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RECOMBINANT TUMOR SPECIFIC ANTIBODY AND USE THEREOF 

5 RELATED APPLICATIONS 

This application claims the benefit of and priority to U.S.S.N. 60/288,564, filed 
May 3, 2001 , the disclosure of which is incorporated by reference herein. 

FIELD OF THE INVENTION 

10 The invention relates generally to recombinant antibodies. More particulary, the 

invention relates to recombinant antibodies that specifically bind human Epithelial Cell 
Adhesion Molecule, and to their use as diagnostic, prognostic and therapeutic agents. 

BACKGROUND OF THE INVENTION 

15 

There has been significant progress in the development of antibody-based 
therapies over the years. For example, investigators have identified not only a variety of 
cancer-specific markers but also a variety of antibodies that bind specifically to those 
markers. Antibodies can be used to deliver certain molecules, for example, a toxin or an 
20 immune stimulatory moiety, for example, a cytokine, to a cancer cell expressing the 

marker so as to selectively kill the cancer cell (see, e.g., U.S. Patent Nos. 5,541,087; and 
5,650,150). 

The KS-1/4 antibody is a mouse-derived monoclonal antibody directed against 
human epithelial cell adhesion molecule (EpCAM). EpCAM is expressed at very low 
25 levels on the apical surface of certain epithelial cells. For example, EpCAM is expressed 
on intestinal cells on the cell surface facing toward ingested food and away from the 
circulation, where it would not be accessible to most proteins and cells of the immune 
system (Balzar et al [1999] J. Mol. Med. 77:699-712). 



30 



Under certain circumstances, however, EpCAM is highly expressed on certain 
cells, for example, tumor cells of epithelial origin. Typically, these tumor cells have lose 
their polarity with the result that EpCAM is expressed over the entire surface of the cell. 
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Thus, EpCAM is a convenient tumor-specific marker for directing antibody-based 
immune-stimulatory moieties to tumor cells (Simon et al [1990] Proc. Nat. Acad. Sci. 
USA 78:2755-2759; Perez et al. [1989] J Immunol. 142:3662-3667). 

However, antibodies can have an associated immunogenicity in the host mammal. 
5 This is more likely to occur when the antibodies are not autologous. Consequently, the 
effectiveness of antibody-based therapies often is by an immunogenic response directed 
against the antibody. The immunogenic response typically is increased when the 
antibody is derived in whole or in part from a mammal different than the host mammal, 
e.g., when the antibody is derived from a mouse and the recipient is a human. 
1 0 Accordingly, it may be helpful to modify mouse-derived antibodies to more closely 
resemble human antibodies, so as to reduce or minimize the immunogenicity of the 
mouse-derived antibody. 

Although a variety of approaches have been developed, including, for example, 
chimeric antibodies, antibody humanization and antibody veneering^Accordingly, there is 
15 a need in the art for antibodies that bind to cancer specific markers and that have reduced 
immunogenicity when administered to a human. Further, there is a need in the art for 
antibodies that deliver toxins or immune stimulatory moieties, for example, as fusion 
proteins or immune conjugates to a cancer specific marker to selectively kill the tumor 
cell. 

20 SUMMARY OF THE INVENTION 

The present invention is based, in part, upon the identification of recombinant 
antibodies that specifically bind human EpCAM but are less immunogenic in humans 
than the template, murine anti-EpCAM antibodies. In particular, the invention provides 
recombinant KS antibodies in which the amino acid sequences defining one or more 
25 framework regions and/or complementarity determining regions have been modified to 
reduce their immunogenicity in humans. 

As used herein, the terms "antibody" and "immunoglobulin" are understood to 
mean (i) an intact antibody (for example, a monoclonal antibody or polyclonal antibody), 
(ii) antigen binding portions thereof, including, for example, an Fab fragment, an Fab' 
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fragment, an (Fab')2 fragment, an Fv fragment, a single chain antibody binding site, an 
sFv, (iii) bi-specific antibodies and antigen binding portions thereof, and (iv) multi- 
specific antibodies and antigen binding portions thereof. 

As used herein, the terms "bind specifically," "specifically bind" and "specific 
5 binding" are understood to mean that the antibody has a binding affinity for a particular 
antigen of at least about 10 6 M" 1 , more preferably, at least about 10 7 M" 1 , more preferably 
at least about 10 8 M" 1 , and most preferably at least about 10 10 M" . 

As used herein, the terms "Complementarity-Determining Regions" and "CDRs" , 
are understood to mean the hypervariable regions or loops of an immunoglobulin variable 
region that interact primarily with an antigen. The immunoglobulin heavy chain variable 
region (Vh) and immunoglobulin light chain variable region (V L ) both contain three 
CDRs interposed between framework regions, as shown in Figure 1 . For example, with 
reference to the amino acid sequence defining the immunoglobulin light chain variable of 
the of the KS-1/4 antibody as shown in SEQ ID NO: 1, the CDRs are defined by the 
amino acid sequences from Ser24 to Leu33 (CDR1), from Asp49 to Ser55 (CDR2), and 
from His88 to Thr96 (CDR3). With reference to the amino acid sequence defining the 
immunoglobulin heavy chain variable region of the KS-1/4 antibody as shown in SEQ ID 
NO: 2, the CDRs are defined by the amino acid sequences from Gly26 to Asn35 (CDR1), 
from Trp50 to Gly66 (CDR2), and from Phe99 to TyrlOS (CDR3). The corresponding 
CDRs of the other antibodies described herein are shown in Figures 1A-1C after 
alignment with the corresponding KS-1/4 heavy or light chain sequence. 

As used herein, the terms "Framework Regions" and "FRs" are understood to 
mean the regions an immunoglobulin variable region adjacent to the Complementarity- 
Determining Regions. The immunoglobulin heavy chain variable region (V H ) and 
25 immunoglobulin light chain variable region (V L ) both contain four FRs, as shown in 
Figure 1 . For example, with reference to the amino acid sequence defining the 
immunoglobulin light chain variable of the of the KS-1/4 antibody as shown in SEQ ID 
NO: 1 , the FRs are defined by the amino acid sequences from Glnl to Cys23 (FR1 ), from 
Trp34 to Phe 48 (FR2), from Gly56 to Cys87 (FR3), and from Phe97 to Lysl06 (FR4). 



10 
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With reference to the amino acid sequence defining the immunoglobulin heavy chain 
variable region of the KS-1/4 antibody as shown in SEQ ID NO: 2, the FRs are defined 
by the amino acid sequences from Glnl to Ser25 (FR1), from Trp36 to Gly49 (FR2), 
from Arg67 to Arg98 (FR3), and from Tip 106 to Serl 16 (FR4). The FRs of the other 
5 antibodies described herein are shown in Figures X and Y after alignment with the 
corresponding KS-1/4 heavy or light chain sequence. 

As used herein, the term "KS antibody" is understood to mean an antibody that 
binds specifically to the same human EpCAM antigen bound by murine antibody KS-1/4 
expressed by a hybridoma (see, for example, Cancer Res. 1984, 44 ((2):681-7). The KS 

10 antibody preferably comprises (i) an amino acid sequence of SASSSVSY (amino acids 
24-31 of SEQ ID NO: 1) defining at least a portion of an immunoglobulin light chain 
CDR1 sequence, (ii) an amino acid sequence of DTSNLAS (amino acids 49-55 of SEQ 
ID NO: 1) defining at least a portion of an immunoglobulin light chain CDR2 sequence, 
(iii) an amino acid sequence of HQRSGYPYT (amino acids 88-96 of SEQ ID NO: 1) 

15 defining at least a portion of an immunoglobulin light chain CDR3 sequence, (iv) an 

amino acid sequence of GYTFTNYGMN (amino acids 26-35 of SEQ ID NO: 2) defining 
at least a portion of an immunoglobulin heavy chain CDR1 sequence, (v) an amino acid 
sequence of WINTYTGEPT Y AD (amino acids 50-62 of SEQ ID NO: 2) defining at least 
a portion of an immunoglobulin heavy chain CDR2 sequence, or (vi) an amino acid 

20 sequence of SKGDY (amino acids 101-105 of SEQ ID NO: 2) defining at least a portion 
of an immunoglobulin heavy chain CDR3 sequence, or any combination of the foregoing. 

In one aspect, the invention provides a recombinant antibody that specifically 
binds EpCAM, wherein the antibody comprises an amino acid sequence, a portion of 
which defines a framework region in an immunoglobulin V L domain. In one 

25 embodiment, the framework region (FR1) is defined by amino acid residues 1-23 of SEQ 
ID NO: 5, wherein Xaal is Q or E, Xaa3 is L or V, XaalO is I or T, Xaal 1 is M or L, 
Xaal3 is A or L, Xaal 8 is K or R, or Xaa21 is M or L, provided that at least one of the 
amino acid residues at positions Xaal, Xaa3, XaalO, Xaall, Xaal 3, Xaal 8, or Xaa21 is 
not the same as the amino acid at the corresponding position in SEQ ID NO: 1. The 

30 amino acids at each of the positions are denoted by the standard single letter code. 
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In another embodiment, the framework region (FR2) is defined by amino acid 
residues 34-48 of SEQ ID NO: 5, wherein Xaa41 is S or Q, Xaa42 is S or A, Xaa45 is P 
or L, or Xaa46 is W or L, provided that at least one of the amino acid residues at 
positions Xaa41, Xaa42, Xaa45, or Xaa46 is not the same as the amino acid at the 
5 corresponding position in SEQ ID NO: 1 . 

In another embodiment, the framework region (FR3) is defined by amino acid 
residues 56-87 of SEQ ID NO: 5, wherein Xaa57 is F or I, Xaa69 is S or D, Xaa71 is S or 
T, Xaa73 is I or T, Xaa77 is M or L, Xaa79 is A or P, Xaa82 is A or F, or Xaa84 is T or 
V, provided that at least one of the amino acid residues at positions Xaa57, Xaa69, 
10 Xaa71, Xaa73, Xaa77, Xaa79, Xaa82, or Xaa84 is not the same as the amino acid at the 
corresponding position in SEQ ID NO: 1 . 

In another aspect, the invention provides a recombinant antibody that specifically 
binds EpCAM, wherein the antibody comprises an amino acid sequence, a portion of 
which defines a framework region in an immunoglobulin V L domain. In one 
15 embodiment, the framework region (FR1) is defined by amino acid residues 1-25 of SEQ 
ID NO: 6, wherein Xaa2 is I or V, Xaa9 is P or A, Xaal 1 is L or V, or Xaal 7 is T or S, 
provided that at least one of the amino acid residues at positions Xaa2, Xaa9, Xaal 1 or 
Xaal 7 is not the same as the amino acid at the corresponding position in SEQ ID NO: 2. 

In another embodiment, the framework region (FR2) is defined by amino acid 
20 residues 36-49 of SEQ ID NO: 6, wherein Xaa38 is K or R, Xaa40 is T or A, or Xaa46 is 
K or E, provided that at least one of the amino acid residues at positions Xaa38, Xaa40, 
Xaa46 is not the same as the amino acid at the corresponding position in SEQ ID NO: 2. 

In another embodiment, the framework region (FR3) is defined by amino acid 
residues 67-98 of SEQ ID NO: 6, wherein Xaa68 is F or V, Xaa69 is A or T, Xaa70 is F 
25 or I, Xaa73 is E or D, Xaa76 is A or T, Xaa80 is F or Y, Xaa83 is I or L, Xaa84 is N or S, 
Xaa85 is N or S, Xaa88 is N, A or S, Xaa91 is M or T, or Xaa93 is T or V, provided that 
at least one of the amino acid residues at positions Xaa68, Xaa69, Xaa70, Xaa73, Xaa76, 
Xaa80, Xaa83, Xaa84, Xaa85, Xaa88, Xaa91 or Xaa93 is not the same as the amino acid 
at the corresponding position in SEQ ID NO: 2. In another embodiment, the framework 
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region (FR4) is defined by amino acid residues 106-1 16 of SEQ ID NO: 6, wherein 
Xaal08 isQorT. 

In another embodiment, the immunoglobulin V L domain comprises an FR1 
sequence selected from the group consisting of: (i) amino acid residues 1-23 of SEQ ID 
NO: 9; and (ii) amino acid residues 1-23 of SEQ ED NO: 8. In another embodiment, the 
immonoglobulin V H domains comprises an FR sequence defined by amino acid residues 
1-25 of SEQ ID NO: 18 and or an FR sequence defined by amino acid residues 67-98 of 
SEQ ID NO: 18. More preferably, the V L domain comprises an amino acid sequence 
defined by amino acids 1-106 of SEQ ID NO: 9 and/or the V H domain comprises an 
amino acid sequence defined by amino acids 1-116 of SEQ ID NO: 18. 

Furthermore, the antibody optionally may include an amino acid sequence 
defining at least a portion of a CDR sequence including, for example, (i) amino acid 
residues 24-3 1 of SEQ ID NO: 1 ; (ii) amino acid residues 49-55 of SEQ ID NO: 1 ; 
and/or (iii) amino acid residues 88-96 of SEQ ID NO: 1. Similarly, the antibody 
1 5 optionally may include an amino acid sequence defining at least a portion of a CDR 
sequence including, for example, (i) amino acid residues 26-35 of SEQ ID NO: 2; (ii) 
amino acid residues 50-62 of SEQ ID NO: 2; and/or iii) amino acid residues 101-105 of 
SEQ ID NO: 2. 

In another embodiment, the antibody comprises the antigen targeting portion of an 
20 antibody-cytokine fusion protein. The cytokine preferably is an interleukin and more 
preferably is interleukin-2. 

In another aspect, the invention provides an expression vector encoding at least a 
portion of the antibody of the invention. In a preferred embodiment, the expression vector 
comprises the nucleotide sequence set forth in SEQ ID NO: 40. 

25 In another aspect, the invention provides a method of diagnosing, prognosing 

and/or treating a human patient having a disease associated with over-expression of 
EpCAM (for example, a disease in which EpCAM is present at a higher level in diseased 
tissue relative to tissue without that disease). The method comprises administering one of 
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the antibodies of the invention to an individual in need of such diagnosis, prognosis or 
treatment. 

The antibody optionally includes a diagnostic and/or therapeutic agent attached 
thereto. The agent may be fused to the antibody to produce a fusion protein. 
5 Alternatively, the agent may be chemically coupled to the antibody to produce an 

immuno-conjugate. It is contemplated that the agent may include, for example, a toxin, 
radiolabel, cytokine, imaging agent or the like. In a preferred embodiment, the antibody 
of the invention is fused as a fusion protein to a cytokine. Preferred cytokines preferably 
include interleukins such as interleukin-2 (IL-2), IL-4, IL-5, IL-6, IL-7, IL-10, IL-12, IL- 

10 13, IL-14, IL-15, IL-16 and IL-18, hematopoietic factors such as granulocyte- 
macrophage colony stimulating factor (GM-CSF), granulocyte colony stimulating factor 
(G-CSF) and erythropoeitin, tumor necrosis factors (TNF) such as TNFa, lymphokines 
such as lymphotoxin, regulators of metabolic processes such as leptin, interferons such as 
interferon a, interferon p, and interferon y, and chemokines. Preferably, the antibody- 

15 cytokine fusion protein displays cytokine biological activity. 



DESCRIPTION OF THE DRAWINGS 

Figures 1A, IB and 1C show an alignment of light and heavy chain variants and 
consensus sequences of KS antibodies. The immunoglobulin Framework Regions (FR1- 
20 FR4) are denoted by The immunoglobulin Complementarity Determining Regions 
(CDR1-CDR3) are denoted by *. Individual KS antibody light chain V region segments 
are referred to as "VK," wherein K refers to the fact that the light chain is a kappa chain. 
Individual KS antibody heavy chain V region segments are referred to as "Vh." 
Substitutable amino acids are denoted by "X" in the consensus sequences. 



25 



DETAILED DESCRIPTION OF THE INVENTION 



The present invention provides recombinant antibodies that specifically bind 
human Epithelial Cell Adhesion Molecule (EpCAM). Preferred antibodies of the 
30 invention have altered variable regions that result in reduced immunogenicity in humans. 



WO 02/090566 PCT/US02/13844 

8 

Antibody variable regions of the invention are particularly useful to target antibodies and 
antibody fusion proteins to tumor tissues that over-express EpCAM in human patients. In 
preferred embodiments, an antibody of the invention is fused to a cytokine to produce an 
immuno-cytokine. 

5 Protein sequences of the invention 

The present invention discloses a family of antibody variable region or V region 
sequences that, when appropriately heterodimerized, bind to human epithelial cell 
adhesion molecule (EpCAM) also known as KS antigen or KSA. Preferred proteins of 
the invention are useful for treating human patients as described herein. Accordingly, 

10 preferred KS antibody variants are humanized, deimmunized, or both, in order to reduce 
their immunogenicity when administered to a human. According to the invention, murine 
KS antibodies can be deimmunized or humanized, for example, by using deimmunization 
methods in which potential T cell epitopes are eliminated or weakened by introduction of 
mutations that reduce binding of a peptide epitope to an MHC Class II molecule (see, for 

1 5 example W098/52976, and WO00/343 1 7), or by using methods in which non-human T 
cell epitopes are mutated so that they correspond to human self epitopes that are present 
in human antibodies (see, for example, U.S. Patent No. 5,712,120). 



L Variable Light Chain 

20 

The recombinant anti-EpCAM antibody has an immunoglobulin variable light 
chain sequence having the following amino acid sequence: 



X-I-X-L-T-Q-S-P-A-X-X-X-X-S-P-G-X-X-X-T-X-T-C- S-A-S-S-S-V-S-T-X-L-W-Y-X- 
25 Q-K-P-G-X-X-P-K-X-X-I-X-D^ 

X-I-X-S-X-E-X-E-D-X-A-X-Y-Y-C-H-Q-R-S-G-Y-P-Y-T-F-G-G-G-T-K-X-E-I-K 
(SEQ ID NO: 3). 

In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 
30 acid sequence defining an immunoglobulin light chain FR1, which is represented by 

residues 1 to 23 of SEQ ID NO: 3, namely, X-I-X-L-T-Q-S-P-A-X-X-X-X-S-P-G-X-X- 
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X-T-X-T-C. More particularly, the recombinant anti-EpCAM antibody has at least one 
of the following amino acids in the FR1 region: Q or E at position Xaal ; L or V at 
position Xaa3; I, T or S at position XaalO; M or L at position Xaal 1 ; S or A at position 
Xaal 2; A, L or V at position Xaal 3; E or Q at position Xaal 7, K or R at position Xaal 8, 
5 V or A at position Xaal 9; and, M, L or I at position Xaa21 . More preferably, the 
recombinant anti-EpCAM antibody has at least one of the following amino acid 
substitutions in the FR1 region: E at position Xaal ; V at position Xaa3; T or S at 
position XaalO; L at position Xaal 1; A at position Xaal 2; L or V at position Xaal 3; Q at 
position Xaal 7, R at position Xaal 8, A at position Xaal 9; and, L or I at position Xaa21. 

10 In another embodiment, the recombinant anti-EpCAM antibody of the invention 

has an amino acid sequence defining an immunoglobulin light chain CDR1, which is 
represented by residues 24 to 33 of SEQ ID NO: 3, namely S-A-S-S-S-V-S-T-X-L. More 
particularly, the recombinant anti-EpCAM antibody of the invention has one of the 
following amino acids in the CDR1 region: M or I at position Xaa32. More preferably, 

15 the recombinant anti-EpCAM antibody has an amino acid substitution in the CDR1 
region, for example, I at position Xaa32. 

In another embodiment, the recombinant anti-EpCAM antibody has an amino acid 
sequence defining an immunoglobulin light chain FR2, which is represented by residues 
34 to 48 of SEQ ID NO: 3, namely W-Y-X-Q-K-P-G-X-X-P-K-X-X-I-X. More 

20 particularly, the recombinant anti-EpCAM antibody has at least one of the following 
amino acids in the FR2 region: Q or L at position Xaa36; S or Q at position Xaa41 ; S, A 
or P at position Xaa42; P or L at position Xaa45; W or L at position Xaa46; and, F or Y 
at position Xaa48. More preferably, the recombinant anti-EpCAM antibody has at least 
one of the following amino acid substitutions in the FR2 region: L at position Xaa36; Q 

25 at position Xaa41 ; A or P at position Xaa42; L at position Xaa45; L at position Xaa46; 
and, Y at position Xaa48. 

In another embodiment, the recombinant anti-EpCAM antibody has an amino acid 
sequence defining an immunoglobulin light chain FR3, which is represented by residues 
56 to 87 of SEQ ID NO: 3, namely, G-X-P-X-R-F-S-G-S-G-S-G-T-X-Y-X-L-X-I-X-S- 

30 X-E-X-E-D-X-A-X-Y-Y-C. More particularly, the recombinant anti-EpCAM antibody 
has at least one of the following amino acids in the FR3 region: F or I at position Xaa57; 
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A or S at position Xaa59; S, D or T at position Xaa69; I or T at position Xaa71 ; I or T at 
position Xaa73; S or N at position Xaa75; M or L at position Xaa77; A or P at position 
Xaa79; A or F at position Xaa82; and, T or V at position Xaa84. More preferably, the 
recombinant anti-EpCAM antibody has at least one of the following amino acid 
5 substitution in the FR3 region: I at position Xaa57; S at position Xaa59; D or T at 
position Xaa69; T at position Xaa71; T at position Xaa73; N at position Xaa75; L at 
position Xaa77; P at position Xaa79; F at position Xaa82; and, V at position Xaa84. 

In another embodiment, the recombinant anti-EpCAM antibody has an amino acid 
sequence defining an immunoglobulin light chain FR4, which is represented by residues 
10 97 to 106 of SEQ ID NO: 3, namely, F-G-G-G-T-K-X-E-I-K. More particularly, the 
recombinant anti-EpCAM antibody of the invention has at least one of the following 
amino acids in the FR4 region, for example, L or V at position Xaal03. Accordingly, the 
recombinant anti-EpCAM antibody of the invention has an amino acid substitution in the 
FR4 region, for example, V at position Xaal03. 

15 

II. Variable Heavy Chain 

The recombinant anti-EpCAM antibody has an immunoglobulin variable heavy 
chain sequence having the following amino acid sequence: 

20 

Q_X-Q-L-V-Q-S-G-X-E-X-K-^ 
W-V-X-Q-X-P-G-X-G-L-X-W-M-G-W^ 

X-X-X-X-T-S-X-S-T-X-X-L -QOC-X-X-L-R-X-E-D-X-A-X-Y-F-C-V-R-FOC-S-K-G-D- 
Y-W-G-X-G-T-X-V-T-V-S-S (SEQ ID NO: 4) 

25 

In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 
acid sequence defining an immunoglobulin heavy chain FR1, which is represented by 
residues 1 to 25 of SEQ ID NO: 4, namely Q-X-Q-L-V-Q-S-G-X-E-X-K-K-P-G-X-X-V- 
K-I-S-C-K-A-S. More particularly, the recombinant anti-EpCAM antibody has at least 
30 one of the following amino acids in the FR1 region: I or V at position Xaa2; P or A at 
position Xaa9; L or V at position Xaal 1 ; E or S at position Xaal6; and, T or S at position 
Xaal7. More preferably, the recombinant anti-EpCAM antibody has at least one of the 
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following amino acid substitutions in the FR1 region: V at position Xaa2; A at position 
Xaa9; V at position Xaal 1; S at position Xaal6; and, S at position Xaal7. 

In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 
acid sequence defining an immunoglobulin heavy chain FR2, which is represented by 
5 residues 36 to 49 of SEQ ID NO: 4, W-V-X-Q-X-P-G-X-G-L-X-W-M-G. More 
particularly, the recombinant anti-EpCAM antibody has at least one of the following 
amino acids in the FR2 region: K or R at position Xaa38; T or A at position Xaa40; K or 
Q at position Xaa43; and, K or E at position Xaa46. More preferably, the recombinant 
anti-EpCAM antibody has at least one of the following amino acid substitutions in the 
10 FR2 region: R at position Xaa38; A at position Xaa40; Q at position Xaa43; and, E at 
position Xaa46. 

In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 
acid sequence defining an immunoglobulin heavy chain CDR2, which is represented by 
residues 50 to 66 of SEQ ID NO: 4, namely W-I-N-T-Y-T-G-E-P-T-Y-A-D-X-F-X-G. 
1 5 More particularly, the recombinant anti-EpCAM antibody has at least one of the 

following amino acids in the CDR2 region: D or K at position Xaa63; and, K or Q at 
position Xaa65. More preferably, the recombinant anti-EpCAM antibody has at least one 
of the following amino acid substitutions in the CDR2 region: K at position Xaa63; and, 
Q at position Xaa65. 

20 In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 

acid sequence defining an immunoglobulin heavy chain FR3, which is represented by 
residues 67 to 98 of SEQ ID NO: 4, namely R-X-X-X-X-X-X-T-S-X-S-T-X-X-L-Q-X- 
X-X-L-R-X-E-D-X-A-X-Y-F-C-V-R. More particularly, the recombinant anti-EpCAM 
antibody of the invention has at least one of the following amino acids in the FR3 region: 

25 F or V at position Xaa68, A, T or V at position Xaa69; F or I at position Xaa70; S or T at 
position Xaa71 ; L or A at position Xaa72; E or D at position Xaa73; A or T at position 
Xaa76; A or L at position Xaa79; F or Y at position Xaa80; I or L at position Xaa83; N or 
S at position Xaa84; N or S at position Xaa85; N, A or S at position Xaa88; M or T at 
position Xaa91; and, T or V at position Xaa93. More preferably, the recombinant anti- 

30 EpCAM antibody has at least one of the following amino acid substitutions in the FR3 
region: V at position Xaa68, T or V at position Xaa69; I at position Xaa70; T at position 
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Xaa71 ; A at position Xaa72; D at position Xaa73; T at position Xaa76; L at position 
Xaa79; Y at position Xaa80; L at position Xaa83; S at position Xaa84; S at position 
Xaa85; A or S at position Xaa88; T at position Xaa9l; and, V at position Xaa93. 

In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 
5 acid sequence defining an immunoglobulin heavy chain CDR3, which is represented by 
residues 99 to 105 of SEQ ID NO: 4, namely F-X-S-K-G-D-Y. More particularly, the 
recombinant anti-EpCAM antibody has at least one of the following amino acids in the 
CDR3 region, for example, I or M at position XaalOO. More preferably, the recombinant 
anti-EpCAM antibody has an amino acid substitution in the CDR3 region, for example, 

10 Mat position XaalOO. 

In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 
acid sequence defining an immunoglobulin heavy chain FR4, which is represented by 
residues 106 to 1 16 of SEQ ID NO: 4, namely W-G-X-G-T-X-V-T-V-S-S. More 
particularly, the recombinant anti-EpCAM antibody has at least one of the following 

1 5 amino acids in the FR4 region: Q or T at position Xaal 08; and, S or T at position XI 1 1. 
More preferably, the recombinant anti-EpCAM antibody has at least one of the following 
amino acid substitutions in the FR4 region: T at position Xaal 08; and, T at position 
Xlll. 

20 III. Refined Variable Lifiht Chain 

In another embodiment, the recombinant anti-EpCAM antibody has an 
immunoglobulin variable light chain sequence having the following amino acid sequence: 

25 X-I-X-L-T-Q-S-P-AO«C-SO(-S-P-G-E^ 
Q-K^-G-X-X-P-K-X-X-I^-D-T-S-N-L-^^ 

X-I-S-S-X-E-X-E-D-X-A-X-Y-Y-C -H-Q-R-S -G-Y-P-Y-T-F-G-G-G-T-K-L -E-I-K (SEQ 
ID NO: 5) 

30 In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 

acid sequence defining an immunoglobulin light chain FR1, which is represented by 
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residues 1 to 23 of SEQ ID NO: 5, namely X-I-X-L-T-Q-S-P-A-X-X-S-X-S-P-G-E-X-V- 
T-X-T-C. More particularly, the recombinant anti-EpCAM antibody has at least one of 
the following amino acids in the FR1 region: Q or E at position Xaal ; L or V at position 
Xaa3; I or T at position XaalO; M or L at position Xaal 1; A or L at position Xaal3; K or 
5 R at position Xaal 8; and, M or L at position Xaa21 . More preferably, the recombinant 
anti-EpCAM antibody has at least one of the following amino acid substitutions in the 
FR1 region: E at position Xaal; V at position Xaa3; T at position XaalO; L at position 
Xaal 1 ; L at position Xaal 3; R at position Xaal 8; and, L at position Xaa2 1 . 

In another preferred embodiment, the recombinant anti-EpCAM antibody has an 

10 amino acid sequence defining an immunoglobulin light FR1 having at least one of the 
following amino acids in the FR1 region: Q or E at position Xaal; A or L at position 
Xaal 1; and, M or L at position Xaa21. More preferably, the recombinant anti-EpCAM 
antibody has an amino acid sequence defining an immunoglobulin light FR1 having at 
least one of the following substitutions in the FR1 region: E at position Xaal ; L at 

1 5 position Xaal 1 ; and, L at position Xaa2 1 . 

In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 
acid sequence defining an immunoglobulin light chain FR2, which is represented by 
residues 34 to 48 of SEQ ID NO: 5, namely W-Y<^Q-K-P-G-XOC-P-K-X-X-I-F. More 
preferably, the recombinant anti-EpCAM antibody has at least one of the following 

20 amino acids in the FR2 region: S or Q at position Xaa41; S or A at position Xaa42; P or 
L at position Xaa45; and, W or L at position Xaa46. More preferably, the recombinant 
anti-EpCAM antibody has at least one of the following amino acid substitutions in the 
FR2 region: Q at position Xaa41 ; A at position Xaa42; L at position Xaa45; and, L at 
position Xaa46. 

25 In another preferred embodiment, the recombinant anti-EpCAM antibody of the 

invention has an amino acid sequence defining an immunoglobulin light FR2 having at 
least one of the following amino acids in the FR2 region: S or A at position Xaa42; P or 
L at position Xaa45; and, W or L at position Xaa46. More preferably, the recombinant 
anti-EpCAM antibody has an amino acid sequence defining an immunoglobulin light 

30 FR2 having at least one of the following substitutions in the FR2 region: A at position 
Xaa42; L at position Xaa45; and, L at position Xaa46. 
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In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 
acid sequence defining an immunoglobulin light chain FR3, which is represented by 
residues 56 to 87 of SEQ ID NO: 5, namely G-X-P-A-R-F-S-G-S-G-S-G-T-X-Y-X-L-X- 
I-S-S-X-E-X-E-D-X-A-X-Y-Y-C More particularly, the recombinant anti-EpCAM 
5 antibody has at least one of the following amino acids in the FR3 region: F or I at 
position Xaa57; S or D at position Xaa69; S or T at position Xaa71 ; I or T at position 
Xaa73; M or L at position Xaa77; A or P at position Xaa79; A or F at position Xaa82; 
and, T or V at position Xaa84. More preferably, the recombinant anti-EpCAM antibody 
has at least one of the following amino acid substitution in the FR3 region: I at position 

1 0 Xaa57; D at position Xaa69; T at position Xaa71 ; T at position Xaa73; L at position 
Xaa77; P at position Xaa79; F at position Xaa82; and, V at position Xaa84. 

In another preferred embodiment, the recombinant anti-EpCAM antibody of the 
invention has an amino acid sequence defining an immunoglobulin light FR3 having at 
least one of the following amino acids in the FR3 region: F or I at position Xaa57; S or D 

1 5 at position Xaa69; A or P at position Xaa79; A or F at position Xaa82; and, T or V at 
position Xaa84. More preferably, the recombinant anti-EpCAM antibody has an amino 
acid sequence defining an immunoglobulin light FR3 having at least one of the following 
substitutions in the FR3 region: I at position Xaa57; D at position Xaa69; P at position 
Xaa79; F at position Xaa82; and, V at position Xaa84. 

20 

IV. Refined Variable Heavy Chain 

The recombinant anti-EpCAM antibody has an immunoglobulin variable heavy 
chain sequence having the following amino acid sequence: 

25 

Q-X-Q-L-V-Q-S-G-X-E-X-K-K^ 

W-VOt-Q-X-P-G-K-G-L-X-W-M-G- W-I-N-T-Y-T-G-E-P-T-Y-A-D-X-F-X-G-R-X-X- 

X-S-L-X-T-S-X-S-T-A-X-L-Q-X-X^ 

W-G-Q-G-T-S-V-T-V-S-S (SEQ ID NO: 6) 

30 

In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 
acid sequence defining an immunoglobulin heavy chain FR1, which is represented by 
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residues 1 to 25 of SEQ ID NO: 6, namely Q-X-Q-L-V-Q-S-G-X-E-X-K-K-P-G-E-X-V. 
K-I-S-C-K-A-S. More preferably, the recombinant anti-EpCAM antibody has at least 
one of the following amino acids in the FR1 region: I or V at position Xaa2; P or A at 
position Xaa9; L or V at position Xaal 1 ; and, T or S at position Xaal7. Accordingly, a 
5 recombinant anti-EpCAM antibody of the invention has at least one of the following 
amino acid substitution in the FR1 region: V at position Xaa2; A at position Xaa9; V at 
position Xaal 1; and, S at position Xaal 7. 

In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 
acid sequence defining an immunoglobulin heavy FR1 having at least one of the 

10 following amino acids in the FR1 region: I or V at position Xaa2; P or A at position 

Xaa9; and, L or V at position Xaal 1 . Accordingly, a recombinant anti-EpCAM antibody 
of the invention has an amino acid sequence defining an immunoglobulin heavy FR1 
having at least one of the following substitutions in the FR1 region: V at position Xaa2; 
A at position Xaa9; and, V at position Xaal 1 . 

1 5 In another embodiment, a recombinant anti-EpC AM antibody of the invention has 

an amino acid sequence defining an immunoglobulin heavy chain FR2, which is 
represented by residues 36 to 49 of SEQ ID NO: 6, namely W-V-X-Q-X-P-G-K-G-L-X- 
W-M-G. More particularly, the recombinant anti-EpCAM antibody has at least one of 
the following amino acid substitution in the FR2 region: K or R at position Xaa38; T or 

20 A at position Xaa40; and, K or E at position Xaa46. More preferably, the recombinant 
anti-EpCAM antibody has at least one of the following amino acid substitution in the 
FR2 region: R at position Xaa38; A at position Xaa40; and, E at position Xaa46. 

In another preferred embodiment, the recombinant anti-EpCAM antibody has an 
amino acid sequence defining an immunoglobulin heavy FR2 having the following amino 

25 acids in the FR1 region, for example, K or E at position Xaa46. More preferably, the 
recombinant anti-EpCAM antibody has an amino acid sequence defining an 
immunoglobulin heavy FR2 having an amino acid substitution in the FR1 region, for 

example, E at position Xaa46. 

In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 
30 acid sequence defining an immunoglobulin heavy chain CDR2, which is represented by 
residues 50 to 66 of SEQ ID NO: 6, namely W-I-N-T-Y-T-G-E-P-T-Y-A-D-X-F-X-G. 
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More particularly, the recombinant anti-EpCAM antibody has at least one of the 
following amino acids in the CDR2 region: D or K at position Xaa63; and, K or Q at 
position Xaa65. More preferably, the recombinant anti-EpCAM antibody has at least one 
of the following amino acid substitutions in the CDR2 region: K at position Xaa63; and, 
5 Q at position Xaa65. 

In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 
acid sequence defining an immunoglobulin heavy chain FR3, which is represented by 
residues 67 to 98 of SEQ ID NO: 6, namely R-X-X-X-S-L-X-T-S-X-S-T-A-X-L-Q-X-X- 
X-L-R-X-E-D-X-A-X-Y-F-C-V-R. More particularly, the recombinant anti-EpCAM 

10 antibody of the invention has at least one of the following amino acids in the FR3 region: 
F or V at position Xaa68; A or T at position Xaa69; F or I at position Xaa70; E or D at 
position Xaa73; A or T at position Xaa76; F or Y at position Xaa80; I or L at position 
Xaa83; N or S at position Xaa84; N or S at position Xaa85; N, A or S at position Xaa88; 
M or T at position Xaa91 ; and, T or V at position Xaa93. More preferably, the 

1 5 recombinant anti-EpCAM antibody has at least one of the following amino acid 

substitutions in the FR3 region: V at position Xaa68; T at position Xaa69; I at position 
Xaa70; D at position Xaa73; T at position Xaa76; Y at position Xaa80; L at position 
Xaa83; S at position Xaa84; S at position Xaa85; A or S at position Xaa88; T at position 
Xaa91 ; and, V at position Xaa93. 

20 In another preferred embodiment, the recombinant anti-EpCAM antibody of the 

invention has an amino acid sequence defining an immunoglobulin heavy chain FR3 
having at least one of the following amino acids in the FR3 region: F or V at position 
Xaa68; E or D at position Xaa73; N or S at position Xaa84; N or S at position Xaa85; N 
or A at position Xaa88; and, T or V at position Xaa93. More preferrably, the 

25 recombinant anti-EpCAM antibody has an amino acid sequence defining an 

immunoglobulin heavy FR3 having at least one of the following substitutions in the FR3 
region: V at position Xaa68; D at position Xaa73; S at position Xaa84; S at position 
Xaa85; A at position Xaa88; and, V at position Xaa93. 

In a preferred embodiment, the recombinant anti-EpCAM antibody has an amino 

30 acid sequence defining an immunoglobulin heavy chain FR4, which is represented by 
residues 106 to 1 16 of SEQ ID NO: 6, namely W-G-X-G-T-S-V-T-V-S-S. More 
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particularly, the recombinant anti-EpCAM antibody has at least one of the following 
amino acids in the FR4 region, for example, Q or T at position Xaal08. More preferably, 
the recombinant anti-EpCAM antibody has an amino acid substitution in the FR4 region, 
for example, T at position Xaal08. 
5 Accordingly, preferred V regions contain substitutions in FR domains of V H 

and/or VK regions corresponding to murine KS-1/4 variable regions. In addition, 
preferred V regions of the invention do not include insertions or deletions of amino acids 

* 

relative to the murine KS-1/4 variable regions. 

Preferred variants include proteins having variable regions with greater than 80% 

10 identity/homology murine KS-1/4. The amino acid sequence of murine KS variable 
region or a portion thereof may be used as a reference sequence to determine whether a 
candidate sequence possesses sufficient amino acid similarity to have a reasonable 
expectation of success in the methods of the present invention. Preferably, variant 
sequences are at least 70% similar or 60% identical, more preferably at least 75% similar 

1 5 or 65% identical, and most preferably 80% similar or 70% identical to a murine KS 
variable heavy or light chain FR or CDR. 

To determine whether a candidate peptide region has the requisite percentage 
similarity or identity to a murine KS sequence, the candidate amino acid sequence and 
murine KS sequence are first aligned using the dynamic programming algorithm 

20 described in Smith and Waterman (1981) J. Mol. Biol. 147:195-197, in combination with 
the BLOSUM62 substitution matrix described in Figure 2 of Henikoff and Henikoff 
(1992) PNAS 89:10915-10919. For the present invention, an appropriate value for the 
gap insertion penalty is -12, and an appropriate value for the gap extension penalty is -4. 
Computer programs performing alignments using the algorithm of Smith- Waterman and 

25 the BLOSUM62 matrix, such as the GCG program suite (Oxford Molecular Group, 

Oxford, England), are commercially available and widely used by those skilled in the art. 
Once the alignment between the candidate and reference sequence is made, a percent 
similarity score may be calculated. The individual amino acids of each sequence are 
compared sequentially according to their similarity to each other. If the value in the 

30 BLOSUM62 matrix corresponding to the two aligned amino acids is zero or a negative 
number, the pairwise similarity score is zero; otherwise the pairwise similarity score is 
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1.0. The raw similarity score is the sum of the pairwise similarity scores of the aligned 
amino acids. The raw score is then normalized by dividing it by the number of amino 
acids in the smaller of the candidate or reference sequences. The normalized raw score is 
the percent similarity. Alternatively, to calculate a percent identity, the aligned amino 

5 acids of each sequence are again compared sequentially. If the amino acids are non- 
identical, the pairwise identity score is zero; otherwise the pairwise identity score is 1.0. 
The raw identity score is the sum of the identical aligned amino acids. The raw score is 
then normalized by dividing it by the number of amino acids in the smaller of the 
candidate or reference sequences. The normalized raw score is the percent identity. 

1 0 Insertions and deletions are ignored for the purposes of calculating percent similarity and 
identity. Accordingly, gap penalties are not used in this calculation, although they are 
used in the initial alignment. 

The invention also discloses methods for assaying the expression of KS 
1 5 antibodies from cells such as mammalian cells, insect cells, plant cells, yeast cells, other 
eukaryotic cells or prokaryotic cells (see Example 1). In a preferred method, KS 
antibody V regions are expressed as components of an intact human antibody, and the 
expression of the antibody from a eukaryotic cell line assayed by an ELISA that detects 
the human Fc region. To precisely quantify binding of a KS antibody to EpCAM, a 
20 Biacore assay may be used. 

Treatment of human disease with KS antibody fusion proteins 

The invention also discloses the sequences of KS antibody-IL2 fusion proteins 
that are useful in treating human disease, such as cancer. Certain KS antibody-IL2 fusion 
25 proteins, such as KS-1/4-IL2 (see, for example, Construct 3 in Example X), may be used 
to treat human patients with cancer, with surprisingly little immune response against the 
antibody. 

It is found that, during treatment of human cancers with KS-1/4(VH2A^K1)-IL2, 
even less immunogenicity is seen than with KS-l/4(Construct 3)-IL2. Specifically, 
30 during a clinical trial, patients with anti-idiotypic antibodies and antibody directed against 
the antibody-IL2 junction or against the IL-2 moiety are seen at an even lower frequency 
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than with KS-l/4(Construct 3)-IL2. Antibody variable regions of the invention can also 
be fused to other cytokines, for example, interleukins 1, 2, 6, 10, or 12; interferons alpha 
and beta; TNF, and INF gamma. The invention may be more fully understood by 
reference to the following non-limiting examples 

5 

EXAMPLES 

Example L Methods and reagents for expressing KS antibodies and assaying their 
antigen-binding activity 

10 1A, Cell culture and transfection 

The following general techniques were used in the subsequent Examples. For 
transient transfection, plasmid DNA was introduced into human kidney 293 cells by co- 
precipitation of plasmid DNA with calcium phosphate [Sambrook et al (1989) Molecular 
Cloning: A Laboratory Manual , Cold Spring Harbor, NY]. 

1 5 In order to obtain stably transfected clones, plasmid DNA was introduced into the 

mouse myeloma NS/0 cells by electroporation. NS/0 cells were grown in Dulbecco's 
modified Eagle's medium supplemented with 10% fetal bovine serum. About 5xl0 6 cells 
were washed once with PBS and resuspended in 0.5 ml phosphate buffer solution (PBS). 
Ten ng of linearized plasmid DNA was then incubated with the cells in a Gene Pulser 

20 Cuvette (0.4 cm electrode gap, BioRad) for 10 minutes on ice. Electroporation was 

performed using a Gene Pulser (BioRad) with settings at 0.25 V and 500 /iF. Cells were 
allowed to recover for 10 minutes on ice, after which they were resuspended in growth 
medium and then plated onto two 96-well plates. Stably transfected clones were selected 
by growth in the presence of 100 nM methotrexate (MTX), which was introduced two 

25 days post-transfection. The cells were fed every 3 days for two to three more times, and 
MTX-resistant clones appeared in 2 to 3 weeks. Supernatants from clones were assayed 
by anti-human Fc EL1SA to identify high producers [Gillies et al (1989) J. Immunol. 
Methods 125:191]. High producing clones were isolated and propagated in growth 
medium containing 100 nM MTX. 
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IB. ELISAs 

Three different ELISAs were used to determine the concentrations of protein 
products in the supernatants of MTX-resistant clones and other test samples. The anti- 
huFc ELISA was used to measure the amount of human Fc-containing proteins, e.g., 
5 chimeric antibodies. The anti-hu kappa ELISA was used to measure the amount of kappa 
light chain (of chimeric or human immunoglobulins). The anti-muFc ELISA was used to 
measure the amount of muFc-containing proteins in test samples (see Example 1C 
below). 

The anti-huFc ELISA is described in detail below. 
10 A. Coating plates 

ELISA plates were coated with AffiniPure goat anti-human IgG (H+L) (Jackson 
Immuno Research) at 5 /ig/ml in PBS, and 100 /xl/well in 96-well plates (Nunc-Immuno 
plate Maxisorp). Coated plates were covered and incubated at 4°C overnight. Plates 
were then washed 4 times with 0.05% Tween (Tween 20) in PBS, and blocked with 1% 
1 5 BSA/1 % goat serum in PBS, 200 /il/well. After incubation with the blocking buffer at 
37°C for 2 hours, the plates were washed 4 times with 0.05% Tween in PBS and tapped 
dry on paper towels. 

B. Incubation with test samples and secondary antibody 

Test samples were diluted to the proper concentrations in sample buffer, which 
20 contained 1% BSA/1% goat serum/0.05% Tween in PBS. A standard curve was prepared 
with a chimeric antibody (with a human Fc), the concentration of which was known. To 
prepare a standard curve, serial dilutions are made in the sample buffer to give a standard 
curve ranging from 125 ng/ml to 3.9 ng/ml. The diluted samples and standards were 
added to the plate, 100 /-il/well, and the plate incubated at 37°C for 2 hours. 

25 After incubation, the plate was washed 8 times with 0.05% Tween in PBS. To 

each well was then added 100 /xl of the secondary antibody, the horse radish peroxidase 
(HRP) -conjugated anti -human TgG (Jackson Immuno Research), diluted around 
1:120,000 in the sample buffer. The exact dilution of the secondary antibody had to be 
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determined for each lot of the HRP-conjugated anti-human IgG. After incubation at 
37°C for 2 hours, the plate was washed 8 times with 0.05% Tween in PBS. 

C. Development 

The substrate solution was added to the plate at 100 jil/well. The substrate 
5 solution was prepared by dissolving 30 mg of o-phenylenediamine dihydrochloride 

(OPD) (1 tablet) into 15 ml of 0.025 M citric acid/0.05M Na2HP04 buffer, pH 5, which 
contained 0.03% of freshly added H2O2. The color was allowed to develop for 30 
minutes at room temperature in the dark. The developing time was subject to change, 
depending on lot to lot variability of the coated plates, the secondary antibody, etc. The 
10 color development in the standard curve was observed to determine when to stop the 
reaction. The reaction was stopped by adding 4N H2S04, 100 /^1/well. The plate was 

read by a plate reader, which was set at both 490 nm and 650 nm and programmed to 
subtract off the background OD at 650 nm from the OD at 490 nm. 

The anti-hu kappa ELISA followed the same procedure as described above, 
1 5 except that the secondary antibody used was horse radish peroxidase-conjugated goat 
anti-hu kappa (Southern Biotechnology Assoc. Inc., Birmingham, AL), used at 1 :4000 
dilution. 

The procedure for the anti-muFc ELISA was also similar, except that ELISA 
plates were coated with AffiniPure goat anti-murine IgG (H+L) (Jackson Immuno 
20 Research) at 5 /xg/ml in PBS, and 100 /il/well; and the secondary antibody was horse 

radish peroxidase-conjugated goat anti-muIgG, Fc7 (Jackson ImmunoResearch), used at 
1:5000 dilution. 

1C. Cloning of the KS antigen (KSA. EpCAM) and expression of the soluble 
form as human EpCAM-murine Fc 

25 Messenger RNA (MRNA) was prepared from LnCAP cells using Dynabeads 

mRNA Direct Kit (Dynal, Inc., Lake Success, NY) according to the manufacturer's 
instructions. After first strand cDNA synthesis with oligo(dT) and reverse transcriptase, 
full length cDNA encoding epithelial cell adhesion molecule (also known as KS antigen 
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or KSA), was cloned by polymerase chain reaction (PCR). The sequences of the PCR 
primers were based on the published sequence described in Perez and Walker (1989) J. 
Immunol. 142:3662-3667. The sequence of the sense primer is 
TCTAGA GCAGCATGGCGCCCCCGCA (SEQ ID NO: 27), and the sequence of the 
5 nonsense primer is CTCGAG TTATGCATTGAGTTCCCT (SEQ ID NO: 28), where the 
translation initiation codon and the anti-codon of the translation stop codon are denoted 
in bold, and the restriction sites Xbal (TCTAGA) and Xhol (CTCGAG) are underlined. 
The PCR product was cloned and the correct KSA sequence was confirmed by 
sequencing several independent clones. The cDNA sequence of the KSA from LnCAP 
1 0 was essentially identical to the published sequence of KSA from UCLA-P3 cells (Perez 
and Walker, 1989). However, at amino acid residue number 115, the nucleotide 
sequence from LnCAP was ATG rather than ACG (Met instead of Thr), and at amino 
acid residue number 277, the nucleotide sequence from LnCAP was ATA rather than 
ATG (He instead of Met). 

1 5 Binding of KS-1/4 antibody to recombinant KSA was demonstrated by 

immunostaining. Surface expression of KSA was obtained by transfecting cells, e.g., 
CT26, B16, etc., with full length KSA in a suitable mammalian expression vector (pdCs, 
as described in U.S. Patent Number 5,541 ,087), followed by immunostaining with the 
KS-1/4 antibody. For the expression of KSA as a soluble antigen, the portion of the 

20 cDNA encoding the transmembrane domain of the KSA was deleted. To facilitate 

expression, detection, and purification, the soluble KSA was expressed as a KSA-muFc, 
the construction of which is described as follows. The 780 bp Xbal-EcoRI restriction 
fragment encoding the soluble KSA was ligated to the Aflll-Xhol fragment encoding the 
muFc (U.S. Patent Number 5,726,044) via a linker-adaptor: 

25 5 ' AA TTC TCA ATG CAG GGC 3' (SEQ ID NO: 29) 

3' G AGT TAC GTC CCG AAT T 5' (SEQ ID NO: 30) 



The Xbal-Xhol fragment encoding soluble KSA-muFc was ligated to the pdCs 
vector. The resultant expression vector, pdCs-KSA-muFc, was used to transfect cells and 
stable clones expressing KSA-muFc were identified by anti-muFc ELISA. 
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ID. Measurement of Antigen Binding 

KSA-muFc in conditioned medium was first purified by Protein A 
chromatography according to supplier's protocol (Repligen, Cambridge, MA). Purified 
KSA-muFc was used to coat 96-well plates (Nunc-Immuno plate, Maxisorp) at 5 fig/ml 
5 in PBS, and 100 /ilAvell. The assay was similar to the ELISA procedure described in 
Example IB. Briefly, coated plates were covered and incubated at 4°C overnight. Plates 
then were washed and blocked. Test samples were diluted to the proper concentrations in 
the sample buffer, added to the plate at 100 /il/well, and the plate was incubated at 37°C 
for 1 hour. After incubation, the plate was washed 8 times with 0.05% Tween in PBS. 
10 To each well was then added 100 fi\ of the secondary antibody, the horse radish 

peroxidase-conjugated anti-human IgG (Jackson Immuno Research), diluted around 
1 :1 20,000 in the sample buffer. The plate was then developed and read as described in 
Example IB. 

IE. Measurement of on-rates and off-rates of KS-1/4 antibodies from EpCAM 
using a Biacore assay. 

The affinity of KS-1/4 and KS-IL2 molecules for the antigen EpCAM were 
measured by surface plasmon resonance analysis of the antibody-antigen interaction, 
using a Biacore machine (Biacore International AB, Uppsala, Sweden). EpCAM- 
murineFc was coupled to a CM5 sensor chip using an amine coupling protocol supplied 
by the manufacturer. KS-1/4 and KS-IL2 at concentrations varying between 25 nm and 
200 nM were then passed over the chip, whereby binding to the chip was observed. Using 
the built-in curve-fitting routines of the Biacore software, the on-rate, off-rate, association 
and dissociation constants were calculated. 

IF. Measurement of binding affinities of KS-1/4 antibodies using cell lines 
25 expressing EpCAM 

Purified KS-1/4 antibodies were iodinated with 125 I using standard techniques, 
and increasing concentrations of labeled protein were incubated with the EpCAM - 
positive cell line PC-3. Saturation binding curves were generated and the dissociation 
constants were determined by Scatchard analysis. 
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Example 2. Cloning of cDNAs encoding Vh and Vk of mouse KS-1/4 and construction 
of vector for the expression of KS-1/4 hvbridoma-derived antibody 

Messenger RNA prepared from the mouse KS-l/4-expressing hybridoma 
(obtained from R. Reisfeld, Scripps Research Institute) was reverse transcribed with 
5 oligo(dT) and then used as templates for PCR to amplify the sequences encoding the 
variable region of the heavy chain (V H ) and the variable region of the light chain (V K ). 
The PCR primers were designed based on published sequences (Beavers et ai, ibid.)- 
The PCR primers for V H had the following sequences: 

V H forward primer (5*) GACTCGAGCCCAAGTCTTAGACATC (3') (SEQ ID NO: 

10 31) 

V H reverse primer (5') CAAGCTTACCTGAGGAGACGGTGACTGACGTTC (3'), 
(SEQ ID NO: 32) 

where the CTCGAG and AAGCTT sequences represent the Xhol and HindUI restriction 
sites, respectively, used for ligating the V H into the expression vector (see below); and the 
1 5 TAC in the reverse primer would introduce GTA, the splice donor consensus sequence, 
in the sense strand of the PCR product. 

The PCR primers for V K had the following sequences: 

V K forward primer (5') GATCTAGACAAGATGGATTTTCAAGTG (3') (SEQ ID 
NO: 33) 

20 V K reverse primer (5') GAAGATCTTACGTTTTATTTCCAGCTTGG (30 (SEQ 

ID NO: 34) 

where the TCTAGA and AGATCT sequences represent the Xbal and Bglll restriction 
sites, respectively, used for ligating the V K into the expression vector (see below); ATG 
is the translation initiation codon of the light chain; and the TAC in the reverse primer 
25 would introduce GTA, the splice donor consensus sequence, in the sense strand of the 
PCR product. 
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The PCR products encoding the Vh and V K of the mouse KS-1/4 antibody were 
cloned into pCRII vector (Invitrogen, Carlsbad, CA). Several Vh and V it clones were 
sequenced and the consensus sequence of each determined. The V H and Vj< sequences 
were inserted in a stepwise fashion into the expression vector pdHL7. The ligations took 
5 advantage of the unique Xhol and Hindlll sites for the V H , and the unique Xbal and 

Bglll/BamHI sites for the V K (the unique Bglll in the Vk. insert and the unique BamHI in 
the vector have compatible overhangs). The resultant construct is called pdHL7- 
hybridoma chKS-1/4, which already contained transcription regulatory elements and 
human Ig constant region sequences for the expression of chimeric antibodies (Gillies et 
10 al (1989) J. Immunol. Methods 125:191). 

The expression vector pdHL7 was derived from pdHL2 [Gillies et ai (1991) 
Hybridoma 10:347-356], with the following modifications: in the expression vector 
pdHL2, the transcriptional units for the light chain and the heavy chain-cytokine 
consisted of the enhancer of the heavy chain immunoglobulin gene and the 
15 metallothionein promoter. In pdHL7, these two transcriptional units consisted of the 
. CMV enhancer-promoter [Boshart et ai (1985) Cell 41 :521-530]. The DNA encoding 
the CMV enhancer-promoter was derived from the Afllll-Hindlll fragment of the 
commercially available pcDNAI (Invitrogen Corp., San Diego, CA). 

Example 3. Expression studies of murine KS-1/4 antibodies 

20 This example discusses expression studies performed using an antibody 

expression plasmid encoding the V region sequences disclosed in U.S. Patent No. 
4,975,369. 

3A. Plasmid Construction 

To directly compare the chimeric antibodies encoded by the Hybridoma KS-1/4 
25 sequence and those sequences described in U.S. Patent No. 4,975,369, the cDNA 

encoding the VH sequence described in U.S. Patent No. 4,975,369 was synthesized. This 
was then ligated into the pdHL7 expression vector already containing the V K of KS-1/4. 
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In order to construct the Vh sequence described in U.S. Patent No. 4,975,369, an 
NdcI-HindHl fragment encoding part of the V H sequence was obtained by total chemical 
synthesis. Overlapping oligonucleotides were chemically synthesized and ligated. The 
ligated duplex was then subcloned into a Xbal-Hindlll pBluescript vector (Stratagene, 
5 LaJolla, CA). 

This DNA encodes the protein sequence IQQPQNMRTM of U.S. Patent 
No. 4,975,369. Immediately V to the coding sequence is the splice donor site beginning 
with gta. The ctag at the 5' end of the top strand is the overhang for the Xbal cloning 
10 site. The Xbal site was created only for cloning into the polylinker of the pBluescript 

vector. It was followed immediately by the Ndel restriction site (CATATG). The agct at 
the 5' end of the bottom strand is the overhang of the Hindlll cloning site. This Hindlll 
sticky end is later ligated to the Hindlll site in the intron preceding the C7I gene [Gillies 
etal (1991)Hybridoma 10:347-356]. 

1 5 After sequence verification, the Ndel-Hindlll restriction fragment was 

isolated. This, together with the Xhol-Ndel fragment encoding the N-terminal half 
of V H , was then ligated to the Xhol-Hindlll digested pdHL7 expression vector 
containing the V K of KS-1/4. The resultant construct, pdHL7-*369 chKS-1/4, 
contained the V K and V H described in U.S. Patent No. 4,975,369 (referred to as 

20 US4,975,369 chKS-1/4). 

3B. Comparison of hybridoma chKS-1/4 and US4.975 369 chKS-1/4 antibodies 

The plasmid DNAs pdHL7-hybridoma chKS-1/4 and pdHL7- 5 369 chKS-1/4 were 
introduced in parallel into human kidney 293 cells by the calcium phosphate 
coprecipitation procedure mentioned above. Five days post-transfection, the conditioned 
25 media were assayed by anti-huFc ELISA and kappa ELISA (see Example 1 for ELISA 
procedures) and the results are summarized in Table 1. 
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Table 1 . 

Antibody huFc ELISA Kappa ELISA 

Hybridoma chKS-1/4 254 ng/mL 200 ng/mL 

5 US4,975,369 chKS-1/4 14 ng/mL 0 ng/mL 

The results indicated that hybridoma chKS-1/4 was expressed and secreted 
normally, and that the secreted antibody consisted of roughly equimolar amounts of 
heavy and light chains, within the accuracies of the two different ELISAs. On the other 
hand, only a low level of heavy chain was detected in the conditioned medium for the 
10 US4,975,369 chKS-1/4 antibody, and no kappa light chain was associated with it. 

Western blot analysis was performed on the total cell lysates and the conditioned 
media of the two transiently transfected cell lines. The procedures for Western blot 
analysis were as described in (Sambrook et al (1989), supra). In order to analyze the 
total cell lysates, the transfected cells were lysed, centrifuged to remove the debris, and 

15 the lysate from the equivalent of 5xl0 5 cells applied per lane. To analyze the conditioned 
media, the protein product from 300 fiL of the conditioned medium was first purified by 
Protein A Sepharose chromatography prior to SDS-PAGE under reducing conditions. 
After Western blot transfer, the blot was hybridized with a horseradish peroxidase- 
conjugated goat anti-human IgG, Fey (Jackson ImmunoResearch), used at 1:2000 

20 dilution. 

The Western blot transfer showed that under the conditions used, the heavy chain 
was detected in both the conditioned media and the lysed cells of the transfection with 
pdHL7-hybridoma chKS-1/4. This result indicates that the heavy chain of the chKS-1/4 
antibody was produced in the cells and secreted efficiently (together with the light chain). 
25 On the other hand, the heavy chain from the transfection with pdHL7-'369 chKS-1/4 was 
detected only in the cell lysate but not in the conditioned media. This result indicated 
that although a comparable level of heavy chain was produced inside the cell, it was not 
secreted. This finding was consistent with the ELISA data, which showed that there was 
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no kappa light chain associated with the small amount of secreted heavy chain in the 
US4,975,369 chKS-1/4 antibody. It is understood that immunoglobulin heavy chains 
typically are not normally secreted in the absence of immunoglobulin light chains 
[Hcndershot et al (1987) Immunology Today 8:11 1]. 

5 In addition to the foregoing, NS/0 cells were transfected by electroporation with 

the plasmids pdHL7-Hybridoma chKS-1/4 and pdHL7-US4,975,369 chKS-1/4 in 
parallel. Stable clones were selected in the presence of 1 00 nM MTX, as described in 
Example 1, and the conditioned media of the MTX-resistant clones in 96-well plates was 
assayed by anti-huFc ELIS A, as described in Example 1 . The results are summarized in 

10 Table 2. 

Table 2 



15 



Total number of Highest level 

Antibody clones screened Mode* of expression* 

Hybridoma chKS-1/4 80 0.1-0.5 /zg/mL (41) 10-50 pg/mL (4) 

US4,975,369 chKS-1/4 47 0-10 ng/mL (36) 0.1-0.4 /ig/mL (4) 

(*The numbers in parentheses denote the number of clones in the mode or the number 
20 expressing the highest levels of product, as determined by anti-Fc ELIS A.) 

When screened at the 96-well stage, the majority of the clones obtained with the 
pdHL7-hybridoma chKS-1/4 construct produced about 100 ng/mL to 500 ng/mL of 
antibody, with the best clones producing about 10-50 fig/mL. On the other hand, the 

25 majority of the clones obtained with the pdHL7-'369 chKS-1/4 construct produced about 
0 ng/mL to 10 ng/mL of antibody, with the best producing about 300-400 ng/mL. To 
examine the composition and binding properties of the US4,975,369 chKS-1/4 antibody, 
it was necessary to grow up the clones that produced at 300-400 ng/mL. Two of these 
clones were chosen for expansion. However, their expression levels were found to be 

30 very unstable. By the time the cultures were grown up to 200 mL, the expression levels 
of both clones had dropped to about 20 ng/mL, as assayed by anti-Fc ELIS A. When the 
same conditioned media were assayed by the anti-kappa ELISA, no kappa light chain was 
detected, as was the case in transient expression in 293 cells. 



WO 02/090566 PCT/US02/1 3844 

29 

The following experiment indicated that no detectable kappa light chain was 
associated with the US4,975,369 chKS-1/4 heavy chain. Briefly, 50 mL each of the 
conditioned media from each of the clones was concentrated by Protein A 
chromatography. The eluate were assayed by anti-Fc ELISA and anti-kappa ELISA. As 
5 a control, conditioned medium from a hybridoma chKS-l/4-producing clone was treated 
the same way and assayed at the same time. The ELISA results are summarized in Table 
3. 

Table 3 

10 Antibody huFc ELISA Kappa ELISA 

Hybridoma chKS- 1 /4 42 /ig/mL 44 jig/mL 

US4,975,369 chKS-l/4-clone 1 253 ng/mL 0 ng/mL 

US4,975,369 chKS-l/4-clone 2 313 ng/mL 0 ng/mL 

The results showed that there was indeed no detectable kappa light chain 
1 5 associated with the US4,975,369 chKS- 1 /4 heavy chain. Furthermore, the hybridoma 
chKS- 1/4 antibody was shown to bind KS antigen at 10-20 ng/mL, whereas the 
US4,975,369 antibody from both clones and concentrated to 253 and 313 ng/mL, still did 
not bind KS antigen (see Example 9 for measurement of binding to KS antigen.) 

Example 4. Expression and characterization of variant KS antibodies 
20 Mutations that significantly lower the expression or the affinity of an antibody for 

a target molecule are expected to be less effective for therapeutic purposes in humans. 
Some approaches to reducing immunogenicity, such as "veneering," "humanization," and 
"deimmunization" involve the introduction of many amino acid substitutions, and may 
disrupt binding of an antibody to an antigen (see, e.g., U.S. Patent Nos. 5,639,641 ; and 
25 5,585,089; and PCT Publication Nos. WO 98/52976; WO 00/343 1 7). There is a need in 
the art for classes of antibody sequences that will bind to epithelial cell adhesion 
molecule, but which are distinct from the original mouse monoclonal antibodies that 
recognize this antigen. 
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Various combinations of KS-1/4 heavy and light chain variable ("V") regions 
were tested for their ability to be expressed, and for their ability to bind to EpCAM. 
These results are summarized in Tables 4-6 and described below. 



5 Table 4. Sequences of KS-1/4 antibody heavy and light chain V regions. 
Light chains: 

10 20 30 40 50 60 

I I I I I I 

VK0 QILLTQSPAIMSASPGEKVTMTCSASSSVSYMLWYQQKPGSSPKPWIFDTSNLASGFPAR 

10 VK 1 QIVLTQS PA SLAV S PGQRATITC S AS S S VS Y I LW YQQKPGQP PKPW I FDTSN LASGFP SR 

VK6 EI VLTQS PATLSLS PGERVTLTC S AS S S VS YMLW YQQKPGQAPKLL I FDTSNLASGI PAR 

VK7 QILLTQSPAIMSASPGERVTMTCSASSSVSYMLWYQQKPGSSPKPWIFDTSNLASGFPAR 

VK8 EIVLTQSPATLSLSPGERVTLTCSASSSVSYMLWYQQKPGSSPKPWIFDTSNLASGFPAR 

15 70 80 90 100 

I I I 

VK0 FSGSGSGTSYSLIISSMEAEDAATYYCHQRSGYPYTFGC 

VK1 FSGSGSGTSYTLTINSLEAEDAATYYCHQRSGYPYTFGC 

VK6 FSGSGSGTDYTLTISSLEPEDFAVYYCHQRSGYPYTFGC 

20 VK7 FSGSGSGTSYSLIISSMEPEDAATYYCHQRSGYPYTFGC 

VK8 FSGSGSGTSYSLIISSMEAEDAATYYCHQRSGYPYTFGC 



(SEQ 


ID 


NO: 


1) 


(SEQ 


ID 


NO: 


11) 


(SEQ 


ID 


NO: 


7) 


(SEQ 


ID 


NO: 


8) 


(SEQ 


ID 


NO: 


9) 



Heavy chains: 

10 20 30 40 50 60 

25 | | | I I I 

VHO QIQLVQSGPELKKPGETVKISCKASGYTFTNYGMNWVKQTPGKGLKWMGWINTYTGEPTY 

VH1 QIQLVQSGPELKKPGSSVKISCKASGYTFTNYGMNWVRQAPGKGLKWMGWINTYTGEPTY 

VH2 QIQLVQSGPELKKPGSSVKISCKASGYTFTNYGMNWVRQAPGKGLKWMGWINTYTGEPTY 

VH2 . 5 QIQLVQSGPELKKPGSSVKISCKASGYTFTNYGMNWVRQAPGKGLKWMGWINTYTGEPTY 

30 VH6 QVQLVQSGAEVKKJPGESVKISCKASGYTFTNYGMNWVRQ^GKGLEWMGWINTYTGEPTY 

VH 7 Q I QLVQSGAEVKKPGETVKI S CKASGYTFTNYGMNWVKQTPGKGLKWMGW I NT YTGE PTY 

VH369 QIQLVQSGPELKKPGETVKISCKASGYTFTNYGMNWVKQTPGKGLKWMGWINTYTGEPTY 

70 80 90 100 110 

35 II III 

VHO ADDFKGRFAFSLETSASTAFLQINNLRNE . DMATYFCVRFISKGDYWGQGTSVTVSS (SEQ ID NO: 2} 

VH1 ADDFKGRF TITA ETSTSTLYLQLNNLRSE . DTATYFCVRFMSKGDYWGQGTTVTVSS (SEQ ID NO 

VH2 ADDFKGRF TITA ETSTSTLYLQLNNLRSE . DTATY FCVRFI SKGD YWGQGTTVTVS S (SEQ ID NO 

VH2.5 ADD F KGRFTITAET SS^ <3 TLYLQLNNLR SE . DTATY FCVR FI S KGD YWGTGTTVTVS S (SEQ ID NO 

40 VH6 AQKFQG RVTI SLDTSTSTAYLQ LSS LRAE . DTAVYFCVRFISKGDYWGQGTSVTVSS (SEQ ID NO 

VH7 ADDFKGRFAFSLETSTSTAFLQINNLRSE . DTATYFCVRFISKGDYWGQGTSVTVSS (SEQ ID NO 

VH369 ADDFKGRFAFSLETSASTAFLQI qqpqninrt MATYFCVRFISKGDYWGQGTSVTVSS (SEQ ID NO 



21) 
22) 

19) 
17) 
18) 
35) 
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Table 5. Sequences of KS-1/4 antibody variants and CDR3 heavy chain variants with 
single amino acid insertions. 



10 



VH2 partial seq. : 
92-109 Of SEQ ID NO 
VH2 . 1 
VH2 .2 
VH2 .3 
VH2 .4 



22) 



ATYFCVRF I S K GDYWGQG 

ATYFCVRF IIS K GDYWGQG 
ATYFCVRF IVS K GDYWGQG 
ATYFCVRF I SAK GDYWGQG 
ATYFCVRF I S KTGDYWGQG 



(amino acid residues 



(SEQ ID NO 

(SEQ ID NO 

(SEQ ID NO 

(SEQ ID NO 



36) 
37) 
38) 
39) 



Table 6. Expression levels and binding activity of variant KS-1/4 antibodies . 



Construct 


Expression 


EpCAM affinity 




Transient (*) 
(in ng/mL) 


Stable O 
(in ng/mL) 


Relative 
binding (**) 


Kd (nM) 


Group 1 










VK0/VH0 (Hybridoma chKS-1/4) 




10-50 


1x 


1.0 x 10"* 


VKO/VH'369 ('369 chKS-1/4) 




o.i -o.4r*> 


»30x 




VK8/VH7 (Construct 3) 




10-50 




1.0x10* 


VK6A/H6 (Construct 1) 


300 




n.d. 




VK7/VH7 (Construct 2) 


30 








VK8/VH7-IL2 




10-50 




1.0 x 10" 3 


VK1/VH1-IL2 




10-50 




7.9x10 y 


VK1/VH2-IL2 




10-50 




3.1 x 10* 












Group 2 










VK8/VH7 (Construct 3; control) 


1500 




1x 




VK07VH1 


1500 




8x 




VK1/VH7 


1500 




1x 




VK1/VH1 


1500 




2x 




VK1A/H2 


1500 




1x-2x 




VK1/VH1-IL2 


1500 




5x 




VK1/VH2-IL2 


1500 




1.5x 




VK1/VH2.5-IL2 


1500 




3x-4x 














Group 3 










VK8/VH7-IL2 (control) 


760 




1x 




VK1A/H1-IL2 


350 




2x 




VK1/VH2.1-IL2 


290 




>10x 




VK1/VH2.2-IL2 


270 




>10x 




VK1A/H2.3-IL2 


190 




7x 




VK1/VH2.4-IL2 


210 




3x 





(*) Routinely achievable levels. 

(**) "Relative Binding" is expressed as the fold-increase in protein concentration required to reach an 
equivalent level of binding. Thus, a larger number reflects a lower affinity for EpCAM. 
(***) Kappa light chain was not detectable by ELISA (equivalent to background); therefore, functional 
20 antibodies were not expressed. 
(****) n.d. = not detectable 

In Group 2 and Group 3, the relative binding activity of each protein was normalized to the control shown in 
the first line for that group. The ELISA assay is primarily a reflection of off-rates, based on amount of protein 
bound after several rounds of washes. It is used as a rapid screen to rule out poor binders, but is not a 
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precise measure of affinity, in Group 3, VH2 variants VH2.1 - VH2.4 were compared with VH1 to determine 
if amino acid insertions might result in improved relative binding. 

The sequences are related as follows. As described in the examples, the VHO and 

5 VKO sequences were derived from PCR amplification from a hybridoma cell line that 
expresses the original mouse-derived KS-1/4 (SEQ ID NO: 1 and SEQ ID NO: 2). VH- 
'369 is the VH sequence disclosed in U.S. Patent No. 4,975,369. Sequences VH1 , VH2, 
VH2. 1-2.4 VK1, and VK2 were derived either using deimmunization technology where 
potential T cell epitopes are eliminated or weakened by introduction of mutations that 

10 reduce binding of a peptide epitope to an MHC Class II molecule, or by changing non- 
human T cell epitopes so that they correspond to human self-epitopes that are present in 
human antibodies. The design of these constructs is ftirther described and analyzed 
below. Constructs of Table 6 were generated by transfecting mammalian cells with 
combinations of nucleic acids that expressed the corresponding heavy and light chain V 

1 5 regions. Sequences VH6, VH7, VK6, VK7, and VK8 were generated by changing 
surface residues of the hybridoma KS-1/4 to human counterparts as described below, 
with the purpose of removing potential human B cell epitopes. Constructs 1 through 3 
were generated by transfecting mammalian cells with combinations of nucleic acids that 
expressed heavy and light chain V regions VH6, VH7, VK6, VK7, and VK8 as described 

20 in Table 4 and below. 



4A. Characterization of KS antibodies with fewer human T cell epitopes 

Sequences VH2.1-VH2.5 were made to test whether certain amino acid insertions 
and substitutions in the region of the KS-1/4 heavy chain CDR3 could be tolerated. 
25 Expression vectors for the light and heavy chain combinations VK0/VH1 , VK1/VH7, 
VK1/VH1, VK1/VH2, VK1/VH1-IL2, VK1/VH2-IL2, and VK1/VH2.5-IL2 were 
constructed and the corresponding antibodies and antibody-IL2 fusion proteins expressed 
and tested according to methods described in the preceding examples. 

Specifically, sequences VHI, VH2, VK1, and VK2 were obtained by total 
30 chemical synthesis. For each of these sequences, a series of overlapping oligonucleotides 
that span the entire coding and complementary strands of these regions were chemically 
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synthesized, phosphorylated, and ligated. The ligated duplex molecules were then 
amplified by PCR with appropriate primers to the fragment ends, introduced into pCRII 
vector (Invitrogen, Carlsbad, CA) and the sequences verified. These DNA fragments 
were then introduced into the expression vector pdHL7 at appropriate sites to generate 
5 the complete heavy ("H") chain and light ("L") chain, respectively. 

Sequence VH2.5 was derived from VH2 by the modification of a single codon to 
obtain a Thr rather than a Gin at position 1 08 (Table 4), using standard molecular biology 
techniques. 

The antibodies were tested by ELISA (Table 6) and using surface plasmon 
10 resonance (Biacore machine and software) to compare their ability to bind to EpCAM. 
Results of the ELISA experiments were considered to reflect primarily off-rate and not 
on-rate, and to be generally less precise, such that a poor ELISA result was generally 
used to exclude certain constructs from further consideration. However, antibodies that 
showed good binding by the ELISA test needed to be characterized further. 

1 5 Results of the surface plasmon resonance analysis were as follows: 

Fusion Protein k on (M" 1 s' 1 ) koff (s" 1 ) Kp (M) 



VK8/VH7-IL2 
VK1/VH2-IL2 
20 VK1/VH1-IL2 



3.1 xlO 5 3.2 x 

1.7 xlO 5 5.3 x 

2.8 xlO 5 2.2 x 



10- 4 1.0 xlO" 9 
10-4 3 i x io-9 
10-3 7,9 x io-9 



Because the off-rate of VK1/VH1-IL2 was much faster than for VK1/V2-EL2 or 
VK8/VH7-IL2, VK1/VH1-IL2 was considered to be a less useful fusion protein. 



25 



Considering that VK1/VH1-IL2 and VK1/VH1 -IL2 differ only by the 
methionine/isoleucine difference at V H position 100 in CDR3, the enhanced off-rate of 
VK1/VH1-IL2 compared to VK1 /VH2-IL2 suggests that this position makes a 
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hydrophobic contact with EpCAM, and that the slightly longer methionine side-chain 
makes a less effective contact. In the field of protein-protein interactions, it is generally 
thought that hydrophobic interactions play a major role in determining off-rates but a 
much less significant role in determining on-rates. 

5 4B. Characterization of KS-1/4 variants with single amino acid insertions 

The importance of the CDR3 sequence in the heavy chain V region for the affinity 
of the KS antibody to EpCAM was determined with a series of variants that contained an 
amino acid insertion or substitution in this region. Sequences VH2.1, VH2.2, VH2.3, and 
VH2.4 were generated by manipulation of an expression vector encoding VH2 and VK1 
10 using standard recombinant DNA techniques. The resulting expression vectors were 
transfected into NS/0 cells and secreted antibody proteins purified as described in 
preceding examples. 

It was found that the VH1 variant was suboptimal compared to the VH2 variant, 
indicating that the isoleucine in CDR3 could not be substituted with methionine. The 

15 next goal was to test whether insertion of an amino acid in CDR3 could yield a KS-1/4 
heavy chain V region with better binding characteristics than VH1. The data in Table 6 
compare the binding of VK1/VH2.1, VK1/VH2.2, VK1/VH2.3, and VK1 /VH2.4, with 
VK1/VH1 . It was found that none of the constructs with an amino acid insertion in the 
KS-1/4 V H CDR3 showed improved antigen binding compared to VH1, rather, antigen 

20 binding activity of the insertion mutants was either somewhat decreased or profoundly 
decreased. 

These results indicate that insertion of amino acids in CDR3 generally is 
deleterious to the antigen binding activity of KS-1/4 heavy chain V regions. When this 
data is analyzed, some general conclusions emerge. Specifically, the segment of KS-1/4 
25 V H amino acid at positions 84 to 108, consisting of the amino acids Asn-Asn-Leu-Arg- 
Asn-Glu-Asp-Met-Ala-Thr-Ty^^ 

Gly-GIn, is important for KS-1/4 antigen binding. This segment includes a framework 
segment, Asn-Asn-Leu-Arg-Asn-Glu-Asp-Met-Ala-Thr-Tyr-Phe-Cys-Val-Arg, which is 
generally tolerant to single and multiple amino acid substitutions, but not tolerant to 
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amino acid insertions, which may have a deletenous effect on expression and assembly. 
In addition, the data suggests that for the amino acids at positions 86, 91 , 93, 94, and 95, 
it is preferable to have hydrophobic amino acids for an antibody that is efficiently 
expressed and binds to EpCAM. 

Insertion of an amino acid in the V H CDR3 segment, consisting of Phe-Ile-Ser- 
Lys-Gly-Asp-Tyr, is generally deleterious to the EpCAM antigen-binding function of a 
KS-1/4 antibody, although some insertions can be tolerated with only partial loss of 
activity. Similarly, substitution of these positions is also generally deleterious to binding 
of the EpCAM antigen, although some insertions can be tolerated with only partial loss of 
activity. 

4C. Construction of active derivatives of KS-1/4 antibodies with mouse surface 
residues converted to their human counterparts 

Antibodies were prepared by substituting amino acids within the KS-1/4 antibody 
with amino acids commonly found in human antibodies in order to minimize the 
immunogenicity of the mouse-derived V regions. Preferred KS derivatives also retained 
specific binding affinity for human EpCAM. 

Construct L It was found that the KS-I/4 light chain most closely resembled 
human consensus subgroup III, and the heavy chain most closely resembled subgroup I. 
Based on these similarities, a conceptual sequence consisting of the human consensus 
subgroup amino acids and KS-l/4-derived CDRs and non-consensus amino acids was 
generated. For this and the following constructs a three-dimensional model was 
generated using a Silicon Graphics Workstation and BioSym molecular modeling 
software. 

Inspection of the three-dimensional model revealed that certain human-derived 
25 amino acids were close to the CDRs and were likely to influence their conformation. 

Based on this analysis, in the light chain, human Ser22, Arg44, and Phe66 were changed 
back to Thr, Lys, and Tyr, respectively. In the heavy chain, it was believed such changes 
were unnecessary.. In the final design for Construct 1, the light chain had 18 human 
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amino acids not found in the mouse light chain, and the heavy chain had 22 human amino 
acids not found in the mouse heavy chain. 

DNAs for expression of Construct 1 were created using synthetic 
oligonucleotides. The Construct 1 protein was efficiently expressed but was found to be 
5 more than 10-fold less active in an EpCAM binding assay. 

Construct 2. A less aggressive approach was then taken, by which only the 
following changes were introduced: 

Light chain: K18R,A79P 

Heavy chain: P9A, LI IV, A76T, N88S, M91T 

1 0 DNAs for expression of Construct 2 were created using synthetic oligonucleotides 

and standard recombinant DNA techniques. The Construct 2 protein was not efficiently 
expressed. It was fiirther found that the combination of Construct 2 light chain and 
mouse KS-1/4 heavy chain was not efficiently expressed, while the combination of 
Construct 2 heavy chain and mouse KS-1/4 light chain was efficiently expressed. Thus, 

15 the expression defect appeared to lie in the Construct 2 light chain. 

Construct 3. Based on the apparent expression defect in the Construct 2 light 
chain, a new light chain was constructed by fusing the N-terminal portion of the light 
chain of Construct 1 with the C-terminal portion of the mouse light chain. The Kpnl site, 
which encodes the amino acids at positions 35 and 36, was used. When this light chain 
20 was combined with the Construct 2 heavy chain, efficient expression and no significant 
loss of binding was observed. 

Because Construct 3 resulted in an antibody with superior properties in terms of 
protein expression and affinity for the antigen when compared to Construct 1 or 2, DNA 
sequences of Construct 3 were inserted into pdHL7s-IL2, resulting in pdHL7s- 
25 VK8/VH7-IL2, which is disclosed as SEQ ID NO: 40. For expression purposes, this 
plasmid DNA was electroporated into mouse myeloma cells NS/0 to produce a stably 
transfected cell line as described in Example 1 A. Culture medium taken from stable 
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clones was then assayed for antibody expression in an ELISA coated with human Fc, as 
described in Example IB. The amino acid sequences of the heavy and light chain for this 
antibody fusion protein are shown in SEQ ID NO: 41 and SEQ ID NO: 42, respectively. 

In addition, the binding of iodinated VK8/VH7 and VK8/VH7-1L2 to EpCAM 
5 expressed on the surface of PC-3 tumor cells was compared to binding o f iodinated 
VK0/VH0-1L2, using methods described in Example IF. Within experimental error, 
essentially identical binding affinities were found for VK8/VH7 and VK0/VH0 ? and for 
VK8/VH7-IL2 and VKO/VH0-1L2. 

10 4D. Structure-function relationships useful in constructing active KS-1/4 

antibodies 

Taken together, the antigen binding activities of KS-1/4 antibodies and fusion 
proteins with the disclosed V region sequences provide guidance in designing sequences 
of KS-1/4 antibodies to EpCAM, as well as for proper expression and secretion of KS-1/4 

1 5 antibodies. In particular, the KS-1/4 heavy and light chain V regions can tolerate 
multiple amino acid substitutions and retain activity, provided that these amino acid 
substitutions are outside the CDRs. The KS-1/4 heavy and light chain V regions do not 
generally appear to tolerate amino acid insertions, especially within CDRs or in 
framework regions between CDRs. 

20 For example, if the hybridoma KS-1/4 sequence is taken to be a starting, "wild- 

type" sequence, the data indicate that the heavy chain V region can tolerate amino acid 
substitutions at positions 9, 1 1, 16, 17, 38, 40, 69, 70, 71, 72, 76, 79, 80, 83, 88, 91, and 
1 1 1 with little or no loss of activity. Similarly, the light chain can tolerate amino acid 
substitutions at positions 1, 3, 10, 11, 12, 13, 17, 18, 19, 21, 41, 42, 59, 71, 73, 75, 77, 

25 and 103 with little or no loss of activity. These changes are outside the CDRs of KS-1/4 
heavy and light chain V regions. The 17 clearly acceptable heavy chain amino acid 
substitutions represent about 21% of the amino acid positions outside the CDRs, and 
about 68% of the amino acid positions outside the CDRs for which an amino acid 
substitution was attempted. Similarly, the eighteen clearly acceptable light chain amino 

30 acid substitutions represent about 23% of the amino acid positions outside the CDRs, and 
about 72% of the amino acid positions outside the CDRs for which an amino acid 
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substitution was attempted. There were only two examples of an amino acid substitution 
outside of a CDR that resulted in a significantly less useful protein: the substitution 
Ala79Pro in the light chain, which appeared to have a negative impact on expression; and 
the substitution Q108T in the heavy chain, which had a negative impact on antigen 
5 binding. Thus, an amino acid substitution can be introduced into a KS-1/4 antibody 

heavy chain or light chain sequence outside of a CDR, and there is a high probability that 
the substitution will result in an active protein. 

Mutations involving the substitution of an amino acid in a CDR often have a 
negative impact on antigen binding. For example, the substitution I100M in the heavy 

1 0 chain reduces binding by about 8-fold. Mutations that involve the insertion of an amino 
acid generally have a negative impact on the utility of a KS- 1/4 sequence. For example, 
the VH-'369 heavy chain V region is unable to assemble into a proper antibody with a 
light chain, as described herein. The VH2.1 to 2.4 mutations have an insertion of an 
amino acid in CDR3 of the heavy chain V region, and each of these mutations has a 

1 5 negative impact on antigen binding. 

Example 5. Immunogenicitv of a KS Antibody (Construct 3HL2 Fusion Protein in 
Humans 

In a human clinical trial, twenty two patients received one or more treatment 
regimes, with each treatment regime comprising three consecutive daily 4-hour 

20 intraveous infusions of KS antibody (Construct 3)-IL2. Each treatment regime was 
separated by a month (Weber et al (2001). Proc. Am. Soc. Clin. Oncology 20:259a.). 
Serum samples were harvested from each patient before and after each treatment regime 
and tested for antibody reactivity against the whole KS Antibody (Construct 3)-IL2 
molecule or the Fc-EL2 component (without the Fv region). No reactivity was observed 

25 in any of the pre-immune sera. The results indicated that only 4 patients experienced any 
significant immune response against either the Fv regions alone, or both the Fv regions 
and the Fc-IL2 component. Furthermore, these responses did not appear to be boosted 
upon subsequent exposure to huKS-IL2. 
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It is believed that the use of the antibody-IL2 fusion protein constitutes a 
particularly stringent test of the immunogenicity of the V region, because the interleukin- 
2 moiety has an adjuvant effect. Accordingly, the results indicate that the KS Antibody 
(Construct 3) may be administered to humans with only a small number of recipients 
5 apparently developing an antibody response to the KS antibody (Construct 3)-IL2 fusion 
protein. These results are particularly encouraging in view of the fact that the KS 
antibody (Construct 3) contains a variable region that is almost entirely murine in origin 
but with a few amino acid residues replaced with the corresponding human amino acid 
residues. 

10 EQUIVALENTS 

The invention may be embodied in other specific forms without departing from 
the spirit or essential characteristics thereof. The foregoing embodiments are therefore to 
be considered in all respects illustrative rather than limiting on the invention described 
herein. The scope of the invention is thus indicated by the appended claims rather than 
15 by the foregoing description, and all changes which come within the meaning and range 
of equivalency of the claims are intended to be embraced therein. 

INCORPORATION BY REFERENCE 

The disclosure of each of the patent documents and scientific publications disclosed 
herein, are incorporated by reference into this application in their entirety. 
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What is claimed is: 

1 . A recombinant anti-EpCAM antibody, wherein the antibody comprises an amino acid 
5 sequence defining an immunoglobulin light chain framework region selected from the 

group consisting of: 

(i) amino acid residues 1-23 of SEQ ED NO: 5, wherein Xaal is Q or E, Xaa3 is L 
or V, XaalO is I or T, Xaal 1 is M or L, Xaal 3 is A or L, Xaal 8 is K or R, or Xaa2 1 is M 
or L, provided that at least one of the amino acid residues at positions Xaal, Xaa3, 

10 XaalO, Xaal 1, Xaal3, Xaal8, or Xaa21 is not the same as the amino acid at the 
corresponding position in SEQ ID NO: 1 ; 

(ii) amino acid residues 34-48 of SEQ ID NO: 5, wherein Xaa41 is S or Q, Xaa42 
is S or A, Xaa45 is P or L, or Xaa46 is W or L, provided that at least one of the amino 
acid residues at positions Xaa41, Xaa42, Xaa45, or Xaa46 is not the same as the amino 

1 5 acid at the corresponding position in SEQ ID NO: 1 ; and 

(iii) amino acid residues 56-87 of SEQ ID NO: 5, wherein Xaa57 is F or I, Xaa69 
is S or D, Xaa71 is S or T, Xaa73 is I or T, Xaa77 is M or L, Xaa79 is A or P, Xaa82 is A 
or F, or Xaa84 is T or V, provided that at least one of the amino acid residues at positions 
Xaa57, Xaa69, Xaa71 , Xaa73, Xaa77, Xaa79, Xaa82, or Xaa84 is not the same as the 

20 amino acid at the corresponding position in SEQ ID NO: 1 . 

2. A recombinant anti-EpCAM antibody, wherein the antibody comprises an amino acid 
sequence defining an immunoglobulin heavy chain framework region selected from the 
group consisting of: 

25 (i) amino acid residues 1 -25 of SEQ ID NO: 6, wherein Xaa2 is I or V, Xaa9 is P 

or A, Xaal 1 is L or V, or Xaal 7 is T or S, provided that at least one of the amino acid 
residues at positions Xaa2, Xaa9, Xaal 1 or Xaal 7 is not the same as the amino acid at 
the corresponding position in SEQ ID NO: 2; 

(ii) amino acid residues 36-49 of SEQ ID NO: 6, wherein Xaa38 is K or R, Xaa40 

30 is T or A, or Xaa46 is K or E, provided that at least one of the amino acid residues at 
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positions Xaa38, Xaa40, Xaa46 is not the same as the amino acid at the corresponding 
position in SEQ ID NO: 2; 

(iii) amino acid residues 67-98 of SEQ ID NO: 6, wherein Xaa68 is F or V, Xaa69 
is A or T, Xaa70 is F or I, Xaa73 is E or D, Xaa76 is A or T, Xaa80 is F or Y, Xaa83 is I 

5 or L, Xaa84 is N or S, Xaa85 is N or S, Xaa88 is N, A or S, Xaa91 is M or T, or Xaa93 is 
T or V, provided that at least one of the amino acid residues at positions Xaa68, Xaa69, 
Xaa70, Xaa73, Xaa76, Xaa80, Xaa83, Xaa84, Xaa85, Xaa88, Xaa91 or Xaa93 is not the 
same as the amino acid at the corresponding position in SEQ ID NO: 2; and 

(iv) amino acid residues 106-1 16 of SEQ ID NO: 6, wherein Xaal08 is Q or T. 



10 



15 



3. The recombinant antibody of claim 1, wherein said light chain framework region is 
selected from the group consisting of: 

(i) amino acid residues 1-23 of SEQ ID NO: 8; and 

(ii) amino acid residues 1-23 of SEQ ID NO: 9. 



4. The recombinant antibody of claim 3, wherein said light chain comprises amino acids 
1-106 of SEQ ID NO: 9. 



5. The recombinant antibody of claim 2, wherein said heavy chain framework region is 
20 selected from the group consisting of: 

(i) amino acid residues 1-25 of SEQ ID NO: 18; and 

(ii) amino acid residues 67-98 of SEQ ID NO: 18. 

6. The recombinant antibody of claim 5, wherein said heavy chain comprises amino 
25 acids 1-1 16 of SEQ ID NO: 18. 



7. A recombinant anti-EpCAM antibody comprising light chain amino acid residues 1- 
106 of SEQ ED NO: 9 and heavy chain amino acid residues of SEQ ED NO: 18. 

30 8. The recombinant antibody of claim 1 or 2 wherein said antibody has a Kd for EpCAM 
of at least 10' 8 M. 
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9. The recombinant antibody of claim 1 or 2 wherein said antibody comprises a cytokine. 

10. The recombinant antibody of claim 9 wherein said cytokine is IL-2. 

5 

11. The recombinant antibody of claim 1 wherein said antibody comprises an amino acid 
sequence selected from the group consisting of: 

(i) amino acid residues 24-3 1 of SEQ ID NO: 1 ; 

(ii) amino acid residues 49-55 of SEQ ID NO: 1 ; and 
1 0 (iii) amino acid residues 88-96 of SEQ ID NO: 1 . 

12. The recombinant antibody of claim 2 wherein said antibody comprises an amino acid 
sequence selected from the group consisting of: 

(i) amino acid residues 26-35 of SEQ ID NO: 2; 
1 5 (ii) amino acid residues 50-62 of SEQ ID NO: 2; and 

(iii) amino acid residues 101-105 of SEQ ID NO: 2. 

1 3. An expression vector encoding an antibody of claim 1 or 2. 
20 14. An expression vector encoding the antibody of claim 7. 

15. An expression vector having a nucleotide sequence set forth in SEQ ED NO: 32. 

16. A method of treating a human patient having a disease associated with EpCAM over- 
25 expression, said method comprising the step of administering an antibody of claim 1 or 2 

to a patient. 

17. The method of claim 16, wherein said antibody further comprises a cytokine. 

30 18. The method of claim 17, wherein said antibody is administered as an antibody- 
cytokine fusion protein. 
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SEQUENCE LISTING 

<110> Gillies, Stephen 
Lo, Kin-Ming 
Qian, Xiugi 

Lexigen Pharmaceuticals Corp. 
<120> Recombinant Tumor Specific Antibody And Use Thereof 
<130> LEX- 019PC 

<150> US 60/288, 564 
<151> 2001-05-03 

<160> 42 

<170> Patentln version 3.0 

<210> 1 
<211> 106 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> KS VK mouse 
<400> 1 

Gin lie Leu Leu Thr Gin Ser Pro Ala lie Met Ser Ala Ser Pro Gly 
15 10 15 

Glu Lys Val Thr Met Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 

Leu Trp Tyr Gin Gin Lys Pro Gly Ser Ser Pro Lys Pro Trp lie Phe 
35 40 45 

Asp Thr Ser Asn Leu Ala Ser Gly Phe Pro Ala Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Ser Tyr Ser Leu lie lie Ser Ser Met Glu Ala Glu 
65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys His Gin Arg Ser Gly Tyr Pro Tyr Thr 

85 90 95 

Phe Gly Gly Gly Thr Lys Leu Glu lie Lys 

100 105 

<210> 2 
<211> 106 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> KS VH mouse 
<400> 2 



Gin lie Gin Leu Val Gin Ser Gly Pro Glu Leu Lys Lys Pro Gly Glu 
15 10 15 
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Thr Val Lys lie 

20 

Gly Met Asn Trp 
35 

Gly Trp lie Asn 
50 

Lys Gly Arg Phe 
65 

Leu Gin Leu Asn 



Val Arg Phe lie 

100 

Thr Val Ser Ser 
115 



Ser Cys Lys Ala 



Val Arg Gin Ala 

40 

Thr Tyr Thr Gly 
55 

Val Phe Ser Leu 
70 

Asn Leu Arg Ser 
85 

Ser Lys Gly Asp 



Ser Gly Tyr Thr 
25 

Pro Gly Lys Gly 



Glu Pro Thr Tyr 

60 

Glu Thr Ser Ala 
75 

Glu Asp Thr Ala 
90 

Tyr Trp Gly Gin 
105 



Phe Thr Asn Tyr 
30 

Leu Lys Trp Met 
45 

Ala Asp Asp Phe 



Ser Thr Ala Phe 

80 

Thr Tyr Phe Cys 
95 

Gly Thr Ser Val 
110 



<210> 3 

<211> 106 

<212> PRT 

c213> Artificial sequence 
<220> 

<2 23> variable light chain sequence in the EpCAM antibody 
<220> 

<221> misc_f eature 

<222> (1) . . (1) 

<223> wherein xaa at position l is a glutamic acid 



<220> 

<221> misc_feature 

<222> (3) . . (3) 

<223> wherein Xaa at position 3 is a valine 



<220> 

<22i> misc_feature 

<222> (10) . . (10) 

<223> wherein Xaa at position 10 is a threonine or a serine 



<220> 

<221> misc_feature 

<222> (11) . . (11) 

<223> wherein Xaa at position 11 is a leucine 



c220> 

<221> misc_feature 

<222> (12) . . (12) 

<223> wherein Xaa at position 12 is an alanine 
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<220> 

<221> misc_f eature 
<222> (13) . . (13) 

<223> wherein Xaa at position 13 is a leucine or a valine 



<220> 

< 2 2 1 > mi s c_f ea t ur e 

<222> (17) . . (17) 

<223> wherein Xaa at position 17 is a glutamine 



<220> 

< 2 2 1 > mi scf ea tur e 
<222> (18) . . (18) 

<223> wherein Xaa at position 

<220> 

< 2 2 1 > mi sc_f ea ture 
<222> (19) . - (19) 

<223> wherein Xaa at position 

<220> 

<221> misc_f eature 

<222> (21) . . (21) 

<223> wherein Xaa at position 



18 is an arginine 



19 is an alanine 



21 is a leucine or an isoleucine 



<220> 

<221> misc_f eature 

<222> (32) . . (32) 

<223> wherein Xaa at position 32 is an isoleucine 



<220> 

<221> misc_f eature 

<222> (36) . . (36) 

<223> wherein Xaa at position 36 is a leucine 



<220> 

<221> misc feature 

<222> (41) . . (41) 

<22 3> wherein Xaa at position 41 is a glutamine 



<220> 

<221> mi sc_f eature 

<222> (42) . . (42) 

<223> wherein Xaa at position 42 is an alanine or a proline 
<220> 

< 22 1 > misc_f eature 

<222> (45) . . (45) 

<223> wherein Xaa at position 45 is a leucine 



<220> 
<221> 



misc feature 
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<222> (46) . . (46) 

<223> wherein Xaa at position 46 is a leucine 



<220> 

<221> misc_f eature 

<222> (48) . . (48) 

<223> wherein Xaa at position 48 is a tyrosine 



<220> 

<221> miscfeature 

<222> (57) . . (57) 

<223> wherein Xaa at position 57 is an isoleucine 



<220> 

<22l> misc_feature 

c222> (59) . . (59) 

<223> wherein Xaa at position 59 is a serine 



<220> 

<221> misc_feature 

<222> (69).. (69) 

<223> wherein Xaa at position 69 is a aspartic acid or a threonine 



<220> 

<22l> misc_feature 

<222> (71) . . (71) 

<223> wherein Xaa at position 71 is a threonine 



<220> 

<221> mi sc_f eature 

<222> (73) . . (73) 

<223> wherein Xaa at position 73 is a threonine 



<220> 

<221> misc_f eature 

<222=> (75) . . (75) 

<223> wherein Xaa at position 7 5 is an asparagine 



<220> 

<221> misc_feature 

<222> (77).. (77) 

<223> wherein Xaa at position 77 is a leucine 



<220> 

<221> mi sc — feature 

<222> (79) . . (79) 

<223> wherein Xaa at position 7 9 is a proline 



<220> 

<221> misc feature 
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<222> (82) . - (82) 

<223> wherein Xaa at position 82 is a phenylalanine 
<220> 

<221> misc_feature 
<222> (84) . . (84) 

<223> wherein Xaa at position 84 is a valine 



<220> 

<221> misc_feature 

<222> (103) . . (103) 

<223> wherein Xaa at position 103 is a valine 



<400> 3 

Xaa lie Xaa Leu Thr Gin Ser Pro Ala Xaa Xaa Xaa Xaa Ser Pro Gly 
15 10 15 

Xaa Xaa Xaa Thr Xaa Thr Cys Ser Ala Ser Ser Ser Val Ser Thr Xaa 

20 25 30 

Leu Trp Tyr Xaa Gin Lys Pro Gly Xaa Xaa Pro Lys Xaa Xaa lie Xaa 
35 40 45 

Asp Thr Ser Asn Leu Ala Ser Gly Xaa Pro Xaa Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Xaa Tyr Xaa Leu Xaa He Xaa Ser Xaa Glu Xaa Glu 
65 70 75 80 

Asp Xaa Ala Xaa Tyr Tyr Cys His Gin Arg Ser Gly Tyr Pro Tyr Thr 

85 90 95 

Phe Gly Gly Gly Thr Lys Xaa Glu He Lys 

100 105 

<210> 4 
<211> 116 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> variable heavy chain sequence in the EpCAM antibody 
<220> 

<221> misc_f eature 
<222> (2) . . (2) 

<223> wherein Xaa at position 2 is an isoleucine or a valine 



<220> 

<221> misc_feature 

<222> (9) . . (9) 

<223> wherein Xaa at position 9 is a proline or an alanine 

<:220> 

<221> misc feature 
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<222> (11).. (11) 

<223> wherein Xaa at position 11 is a leucine or a valine 
<220> 

<22l> misc_feature 
<222> (16) . . (16) 

<223> wherein Xaa at position 16 is a glutamic acid or a serine 



<220> 

<221> misc_f eature 

<222> (17) . . (17) 

<223> wherein Xaa at position 17 is a threonine or a serine 



<220> 

<221> misc_f eature 

<222> (38) . . (38) 

<223> wherein Xaa at position 38 is a lysine or an arginine 



<220> 

<221> misc_f eature 

<222> (40) . . (40) 

<223> wherein Xaa at position 40 is a threonine or an alanine 



<220> 

<221> misc_f eature 

<222> (43) . . (43) 

<223> wherein Xaa at position 43 is a lysine or a glutamine 



<220> 

<221> misc_f eature 

<222> (46) . . (46) 

<223> wherein Xaa at position 46 is a lysine or a glutamic acid 



<220> 

<221> msic_feature 

<222> (63) . . (63) 

<223> wherein Xaa at position 63 
<220> 

<221> misc_£eature 

<222> (65) . . (65) 

<223> wherein Xaa at position 65 
<220> 

< 2 2 1 > mis c_f eature 

<222> (68) . . (68) 

<223> wherein Xaa at position 68 



is an aspartic acid or a lysine 



is a lysine or a glutamine 



is a phenylalanine or a valine 



<220> 

<22l> misc_f eature 
<222> (69) . . (69) 
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<223> wherein Xaa at position 69 is an alanine, a threonine or a 
valine 

<220> 

<221> mi sc_f eature 

<222> (70) . . (70) 

<223> wherein Xaa at position 70 is a phenylalanine or an isoleucine 



<220> 

<221> misc_f eature 

<222> (71) . . (71) 

<223> wherein Xaa at position 71 is a serine or a threonine 



<220> 

<221> mi sc_f eature 

<222> (72) . . (72) 

<223> wherein Xaa at position 72 is a leucine or an alanine 
<220> 

<221> misc_f eature 

<222> (73) . . (73) 

<223> wherein Xaa at position 73 is a glutamic acid or an aspartic 
acid 



<220> 

<221> misc_feature 

<222> (76) . . (76) 

<223> wherein Xaa at position 76 is an alanine or a threonine 
<220> 

<22l> misc_feature 

<222> (79) . . (79) 

<223> wherein Xaa at position 79 is an alanine or a leucine 
<220> 

<221> misc_f eature 

<222> (80) . . (80) 

<:223> wherein Xaa at position 80 is a phenylalanine or a tyrosine 
<220> 

<221> misc feature 

<222> (83) . . (83) 

<223> wherein Xaa at position 83 is an isoleucine of a leucine 



<220> 

<221> misc_f eature 

<222> (84) . . (84) 

<223> wherein Xaa at position 84 is an asparagine or a serine 



<220> 
<221> 



misc feature 
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<222> (85) . . (85) 

<223> wherein Xaa at position 85 is an asparagine or a serine 
<220> 

<221> misc_feature 
<222> (88) . . (88) 

<223> wherein Xaa at position 88 is an asparagine, an alanine or a 
serine 



<220> 

<221> misc_feature 

<222> (91) . . (91) 

<223> wherein Xaa at position 91 is a methionine or a threonine 



<220> 

<221> misc_feature 

<222> (93) . . (93) 

<223> wherein Xaa at position 93 is a threonine or a valine 



<220> 

<221> misc_f eature 

<222> (100) . . (100) 

<223> wherein Xaa at position 100 is an isoleucine or a methionine 



<220> 

<221> misc_f eature 

<222> (108) . . (108) 

<223> wherein Xaa at position 108 is a glutamine or a threonine 
<220> 

<221> misc_feature 

<222> (111) . . (Ill) 

<223> wherein Xaa at position 111 is a serine or a threonine 



<400> 4 

Gin Xaa Gin Leu 
1 

Xaa Val Lys lie 

20 

Gly Met Asn Trp 

35 

Gly Trp lie Asn 
50 

Xaa Gly Arg Xaa 
65 

Leu Gin Xaa Xaa 



Val Gin Ser Gly 

5 

Ser Cys Lys Ala 



Val Xaa Gin Xaa 

40 

Thr Tyr Thr Gly 
55 

Xaa Xaa Xaa Xaa 

70 

Xaa Leu Arg Xaa 
85 



Xaa Glu Xaa Lys 
10 

Ser Gly Tyr Thr 
25 

Pro Gly Xaa Gly 



Glu Pro Thr Tyr 

60 

Xaa Thr Ser Xaa 

75 

Glu Asp Xaa Ala 
90 



Lys Pro Gly Xaa 
15 

Phe Thr Asn Tyr 
30 

Leu Xaa Trp Met 
45 

Ala Asp Xaa Phe 



Ser Thr Xaa Xaa 

80 

Xaa Tyr Phe Cys 
95 
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Val Arg Phe Xaa Ser Lys Gly Asp Tyr Trp Gly Xaa Gly Thr Xaa Val 

100 105 110 

Thr Val Ser Ser 

115 

<210> 5 
<211> 106 
<212> PRT 

<213> Artificial sequence 



<220> 

<223> light sequence consensus 
<220> 

<221> misc_feature 

<222> (1) . . (1) 

<223> wherein xaa at position l is a glutamine or a glutamic acid 

<220> 

<221> misc_feature 

<222> (3) . . (3) 

<223> wherein Xaa at position 3 is a leucine or a valine 



<220> 

<221> misc_feature 

<222> (10) . . (10) 

<223> wherein Xaa at position 10 is an isoleucine or a threonine 



<220> 

<22l> misc_feature 

<222> (11) . . (11) 

<223> wherein Xaa at position 11 is a methionine or a leucine 



<220> 

<22l> misc_f eature 

<222> (13).. (13) 

<223> wherein Xaa at position 13 is an alanine or a leucine 



<220> 

<221> misc_£eature 

<222> (18).. (18) 

<223> wherein Xaa at position 18 is a lysine or an arginine 



<220> 

<221> misc_feature 

<222> (21) . . (21) 

<223> wherein Xaa at position 21 is a methionine or a leucine 



<220> 

<221> misc_feature 

<222> (41) . . (41) 

<2 23> wherein Xaa at position 41 is a serine or a glutamine 
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<220> 

<221> mi sc_f eature 

<222> (42) . . (42) 

<223> wherein Xaa at position 42 is a serine or an alanine 



<220> 

<221> mi sc__f eature 

<222> (45) . . (45) 

<223> wherein Xaa at position 45 is a proline or a leucine 



<220> 

<22l> misc_feature 

<222> (46) . . (46) 

<223> wherein Xaa at position 46 is a tryptophan or a leucine 



<220> 

<221> misc_f eature 

<222> (57) . . (57) 

<223> wherein Xaa at position 57 is a phenylalanine or an isoleucine 



<220> 

<221> mi sc__f eature 

<222> (69) . . (69) 

<223> wherein Xaa at position 69 is a serine or an aspartic acid 



<220> 

<221> misc_f eature 

<222> (71) . . (71) 

<223> wherein Xaa at position 71 is a serine or a threonine 



<220> 

<221> mi sc_f eature 

<222> (73) . . (73) 

<223> wherein Xaa at position 73 is an isoleucine or a threonine 



<220> 

<22l> misc__f eature 

<222> (77) . . (77) 

<223> wherein Xaa at position 77 is a methionine or a leucine 



<220> 

<22l> misc_f eature 

<222> (79) . . (79) 

<223> wherein Xaa at position 79 is an alanine or a proline 



<220> 

< 2 2 1 > mis cofeature 

<222> (82) . . (82) 

<223> wherein Xaa at position 82 is an alanine or a phenylalanine 



WO 02/090566 
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<220> 

<22l> misc_f eature 

<222> (84) . . (84) 

<223> wherein Xaa at position 84 is a threonine or a valine 



<400> 5 

Xaa lie Xaa Leu 
1 

Glu Xaa Val Thr 

20 

Leu Trp Tyr Gin 
35 

Asp Thr Ser Asn 
50 

Gly Ser Gly Thr 
65 

Asp Xaa Ala Xaa 



Phe Gly Gly Gly 

100 



Thr Gin Ser Pro 

5 

Xaa Thr Cys Ser 



Gin Lys Pro Gly 

40 

Leu Ala Ser Gly 
55 

Xaa Tyr Xaa Leu 
70 

Tyr Tyr Cys His 
85 

Thr Lys Leu Glu 



Ala Xaa Xaa Ser 
10 

Ala Ser Ser Ser 
25 

Xaa Xaa Pro Lys 



Xaa Pro Ala Arg 

60 

Xaa lie Ser Ser 
75 

Gin Arg Ser Gly 
90 

lie Lys 
105 



Xaa Ser Pro Gly 
15 

Val Ser Thr Met 
30 

Xaa Xaa lie Phe 

45 

Phe Ser Gly Ser 



Xaa Glu Xaa Glu 

80 

Tyr Pro Tyr Thr 
95 



<210> 6 

<211> 116 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> heavy sequence consensus 
<220> 

<221> misc_f eature 

<222> (2) . . (2) 

<223> wherein Xaa at position 2 is an isoleucine or a valine 



<220> 

<22l> misc_f eature 

<222> (9).. (9) 

<223> wherein Xaa at position 9 is a proline or an alanine 



<220> 

<221> misc_f eature 

<222> (11) . . (11) 

<223> wherein Xaa at position 11 is a leucine or a valine 



<220> 

<221> mis cofeature 

<222> (17) . . (17) 

<223> wherein Xaa at position 17 is a threonine or a serine 
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<220> 

<221> misc_feature 

<222> (38) . . (3B) 

<223> wherein Xaa at position 38 is a lysine or an arginine 



<220> 

<221> misc_f eature 

<222> (40) . . (40) 

<223> wherein Xaa at position 40 is a threonine or an alanine 



<220> 

<221> mi sc_f eature 

<222> (46) . . (46) 

<223> wherein Xaa at position 46 is a lysine or a glutamic acid 



<220> 

< 2 2 1 > mis c_f ea tur e 

<222> (63) . . (63) 

<223> wherein Xaa at position 63 is an aspartic acid or a lysine 



<220> 

<221> misc_feature 

<222> (65) . . (65) 

<223> wherein Xaa at position 65 is a lysine or a glutamine 



<220> 

<221> misc_feature 

<222> (68) . . (68) 

<223> wherein Xaa at position 68 is a phenylalanine or a valine 



<220> 

<221> misc_feature 

<222> (69) . . (69) 

<223> wherein Xaa at position 69 is an alanine or a threonine 
<220> 

<221> mi sc__f eature 

<222> (70).. (70) 

<223> wherein Xaa at position 70 is a phenylalanine or an isoleuci 



<220> 

<221> misc_f eature 

<222> (73).. (73) 

<223> wherein Xaa at position 73 is a glutamic acid or an aspartic 
acid 



<220> 

<221> misc_feature 

<222> (76) . . (76) 

<223> wherein Xaa at position 76 is an alanine or a threonine 



♦ 
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<220> 

<221> misc_feature 

<222> (80) . . (80) 

<223> wherein Xaa at position 80 is a phenylalanine or a tyrosine 
<220> 

<221> misc_f eature 

<222> (83) . . (83) 

<223> wherein Xaa at position 83 is an isoleucine or a leucine 
<220> 

< 2 21 > misc_jf eature 

<222> (84) . . (84) 

<223> wherein Xaa at position 84 is an asparagine or a serine 
<220> 

<221> misc_f eature 

<222> (85) . . (85) 

<223> wherein Xaa at position 85 is an asparagine or a serine 
<220> 

<221> mi sc_f eature 

<222> (88) . . (88) 

<223> wherein Xaa at position 88 is an asparagine, an alanine or a 
serine 



<220> 

<221> misc_f eature 

<222> (91) . . (91) 

<223> wherein Xaa at position 91 is a methionine or a threonine 
<220> 

<22l> misc_f eature 

<222> (93).. (93) 

<223> wherein Xaa at position 93 is a threonine or a valine 
<220> 

<22i> misc_f eature 

<222> (108) . . (108) 

<223> wherein Xaa at position 108 is a glutamine or a threonine 

<400> 6 

Gin Xaa Gin Leu Val Gin Ser Gly Xaa Glu Xaa Lys Lys Pro Gly Glu 
1 5 io 15 

Xaa Val Lys He Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 

20 25 30 

Gly Met Asn Trp Val Xaa Gin Xaa Pro Gly Lys Gly Leu Xaa Trp Met 
35 40 ' 45 
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Gly Trp He Asn Thr Tyr Thr Gly Glu Pro Thr Tyr Ala Asp Xaa Phe 
50 55 60 



Xaa Gly Arg Xaa Xaa Xaa Ser Leu Xaa Thr Ser Xaa Ser Thr Ala Xaa 
65 70 75 80 

Leu Gin Xaa Xaa Xaa Leu Arg Xaa Glu Asp Xaa Ala Xaa Tyr Phe Cys 

85 90 95 

Val Arg Phe He Ser Lys Gly Asp Tyr Trp Gly Xaa Gly Thr Ser Val 

100 105 no 

Thr Val Ser Ser 
115 

<210> 7 
<211> 106 
<2X2> PRT 

<213> Artificial sequence 



<220> 

<223> Vk6 light chain 
<400> 7 

Glu He Val Leu Thr Gin Ser Pro Ala Thr Leu Ser Leu Ser Pro Gly 
1 5 10 15 

Glu Arg Val Thr Leu Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 

Leu Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Lys Leu Leu He Phe 
35 40 45 

Asp Thr Ser Asn Leu Ala Ser Gly He Pro Ala Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Tyr Thr Leu Thr He Ser Ser Leu Glu Pro Glu 
65 70 75 80 

Asp Phe Ala Val Tyr Tyr Cys His Gin Arg Ser Gly Tyr Pro Tyr Thr 

85 90 95 

Phe Gly Gly Gly Thr Lys Leu Glu lie Lys 

100 105 



<210> 


8 


<211> 


106 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


VK7 light chain 


<400> 


8 



Gin He Leu Leu Thr Gin Ser Pro Ala He Met Ser Ala Ser Pro Gly 
15 10 15 
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Glu Arg Val Thr Met Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 

Leu Trp Tyr Gin Gin Lys Pro Gly Ser Ser Pro Lys Pro Trp lie Phe 
35 40 45 

Asp Thr Ser Asn Leu Ala Ser Gly Phe Pro Ala Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Ser Tyr Ser Leu lie lie Ser Ser Met Glu Pro Glu 
65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys His Gin Arg Ser Gly Tyr Pro Tyr Thr 

85 90 95 

Phe Gly Gly Gly Thr Lys Leu Glu lie Lys 

100 105 

<210> 9 
<211> 106 
<:212> PRT 

<213> Artificial sequence 
<220> 

<223> VK8 light chain 
<400> 9 

Glu lie Val Leu Thr Gin Ser Pro Ala Thr Leu Ser Leu Ser Pro Gly 
15 10 15 

Glu Arg Val Thr Leu Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 

Leu Trp Tyr Gin Gin Lys Pro Gly Ser Ser Pro Lys Pro Trp lie Phe 

35 40 45 

Asp Thr Ser Asn Leu Ala Ser Gly Phe Pro Ala Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Ser Tyr Ser Leu lie lie Ser Ser Met Glu Ala Glu 
65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys His Gin Arg Ser Gly Tyr Pro Tyr Thr 

85 90 95 

Phe Gly Gly Gly Thr Lys Leu Glu lie Lys 

100 105 



<210> 


10 


<211> 


106 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


KS VK veneered 


<400> 


10 



Gin He Leu Leu Thr Gin Ser Pro Ala Ser Leu Ala Val Ser Pro Gly 
1 5 10 15 
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Gin Arg Ala Thr lie Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 

Leu Trp Tyr Gin Gin Lys Pro Gly Gin Pro Pro Lys Pro Trp He Phe 
35 40 45 

Asp Thr Ser Asn Leu Ala Ser Gly Phe Pro Ala Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Ser Tyr Thr Leu Thr He Asn Ser Leu Glu Ala Glu 

65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys His Gin Arg Ser Gly Tyr Pro Tyr Thr 

85 90 95 

Phe Gly Gly Gly Thr Lys Val Glu He Lys 

100 105 

<210> 11 
<211> 106 
<212> PRT 

<213> Artificial sequence 



<220> 

<223> KS de- immunized VKl 
<400> 11 

Gin He Val Leu Thr Gin Ser Pro Ala Ser Leu Ala Val Ser Pro Gly 
15 10 15 

Gin Arg Ala Thr He Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr lie 

20 25 30 

Leu Trp Tyr Gin Gin Lys Pro Gly Gin Pro Pro Lys Pro Trp He Phe 
35 40 45 

Asp Thr Ser Asn Leu Ala Ser Gly Phe Pro Ser Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Ser Tyr Thr Leu Thr He Asn Ser Leu Glu Ala Glu 
65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys His Gin Arg Ser Gly Tyr Pro Tyr Thr 

85 90 95 

Phe Gly Gly Gly Thr Lys Val Glu He Lys 

105 





100 


<210> 


12 


<211> 


106 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


KS de- immunized VK2 



<400> 12 
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Gin lie Val Leu Thr Gin Ser Pro Ala Ser Leu Ala Val Ser Pro Gly 
15 10 15 

Gin Arg Ala Thr lie Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 

Leu Trp Tyr Gin Gin Lys Pro Gly Gin Pro Pro Lys Pro Trp He Phe 
35 40 45 

Asp Thr Ser Asn Leu Ala Ser Gly Phe Pro Ser Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Ser Tyr Thr Leu Thr lie Asn Ser Leu Glu Ala Glu 
65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys His Gin Arg Ser Gly Tyr Pro Tyr Thr 

85 90 95 

Phe Gly Gly Gly Thr Lys Val Glu lie Lys 

100 105 



<210> 13 
<211> 106 
<212> PRT 

<213> Artificial sequence 




<220> 

<223> KS-deimmunized VK3 




<400> 13 












Gin He Leu 
l 


■ 

Leu 


Thr 
5 


Gin 


Ser 


Pro Ala Ser Leu Ala Val Ser Pro Gly 
10 15 


Gin Arg Ala 


Thr 
20 


lie 


Thr 


Cys 


Ser Ala Ser Ser Ser Val Ser Tyr Met 
25 30 


Leu Trp Tyr 
35 


Gin 


Gin 


Lys 


Pro Gly Gin Pro Pro Lys Pro Trp He Phe 
40 45 


Asp Thr Ser 
50 


Asn 


Leu 


Ala 


Ser 
55 


Gly Phe Pro Ser Arg Phe Ser Gly Ser 

60 


Gly Ser Gly 
65 


Thr 


Ser 


Tyr 
70 


Thr 


Leu Thr He Asn Ser Leu Glu Ala Glu 

75 80 


Asp Ala Ala 


Thr 


Tyr 
85 


Tyr 


Cys 


His Gin Arg Ser Gly Tyr Pro Tyr Thr 
90 95 


Phe Gly Gly 


Gly 
100 


Thr 


Lys 


Val 


Glu He Lys 
105 



<210> 14 

<211> 106 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> KS de- immunized VK4 



<400> 



14 
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Gin lie Leu Leu Thr Gin Ser Pro Ala Ser Leu Ala Val Ser Pro Gly 
15 10 15 

Gin Arg Ala Thr lie Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 

Leu Trp Tyr Gin Gin Lys Pro Gly Gin Pro Pro Lys Pro Trp lie Phe 
35 40 45 

Asp Thr Ser Asn Leu Ala Ser Gly Phe Pro Ala Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Ser Tyr Thr Leu Thr He Asn Ser Leu Glu Ala Glu 
65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys His Gin Arg Ser Gly Tyr Pro Tyr Thr 

85 90 95 

Phe Gly Gly Gly Thr Lys Val Glu He Lys 

100 105 

<210> 15 
<211> 106 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> KS de- immunized VK5 
<400> 15 

Gin He Leu Leu Thr Gin Ser Pro Ala Ser Leu Ala Val Ser Pro Gly 
15 10 15 

Gin Arg Ala Thr He Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 

Leu Trp Tyr Gin Gin Lys Pro Gly Ser Ser Pro Lys Pro Trp He Tyr 
35 40 45 

Asp Thr Ser Asn Leu Ala Ser Gly Phe Pro Ala Arg Phe Ser Gly Ser 

50 55 60 

Gly Ser Gly Thr Ser Tyr Thr Leu Thr He Asn Ser Leu Glu Ala Glu 
65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys His Gin Arg Ser Gly Tyr Pro Tyr Thr 

85 90 95 

Phe Gly Gly Gly Thr Lys Val Glu He Lys 

100 105 



<210> 


16 


<211> 


106 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


KS VK mouse 



<400> 



16 



WO 02/090566 PCT/US02/13844 
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Gin lie Val Leu Thr Gin Ser Pro Ala Thr Leu Ser Ala Ser Pro Gly 
15 10 15 

Glu Arg Val Thr lie Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 

Leu Trp Tyr Leu Gin Lys Pro Gly Ser Ser Pro Lys Pro Trp lie Phe 
35 40 45 

Asp Thr Ser Asn Leu Ala Ser Gly Phe Pro Ser Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Thr Tyr Ser Leu lie He Ser Ser Leu Glu Ala Glu 
65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys His Gin Arg Ser Gly Tyr Pro Tyr Thr 

85 90 95 

Phe Gly Gly Gly Thr Lys Leu Glu He Lys 

100 105 

<210> 17 
<211> 116 
<212> PRT 

<213> Artificial sequence 



<220> 

<223> VH6 heavy chain 

* 

<400> 17 

Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Glu 
1 5 10 is 

Ser Val Lys He Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 

20 25 30 

Gly Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Met 
35 40 45 

Gly Trp He Asn Thr Tyr Thr Gly Glu Pro Thr Tyr Ala Gin Lys Phe 
50 55 60 

Gin Gly Arg Val Thr He Ser Leu Asp Thr Ser Thr Ser Thr Ala Tyr 
65 70 75 80 

Leu Gin Leu Ser Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Phe Cys 

85 90 95 

Val Arg Phe He Ser Lys Gly Asp Tyr Trp Gly Gin Gly Thr Ser Val 

100 105 no 

Thr Val Ser Ser 
115 

<210> 18 

<211> 116 

<212> PRT 

<213> Artificial sequence 



WO 02/090566 
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<220> 

<223> VH7 heavy chain 
<400> 18 

Gin lie Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Glu 
15 10 15 

Thr Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 

20 25 30 

Gly Met Asn Trp Val Lys Gin Thr Pro Gly Lys Gly Leu Lys Trp Met 
35 40 45 

Gly Trp lie Asn Thr Tyr Thr Gly Glu Pro Thr Tyr Ala Asp Asp Phe 
50 55 60 

Lys Gly Arg Phe Ala Phe Ser Leu Glu Thr Ser Thr Ser Thr Ala Phe 
65 70 75 80 

Leu Gin lie Asn Asn Leu Arg Ser Glu Asp Thr Ala Thr Tyr Phe Cys 

85 90 95 

Val Arg Phe lie Ser Lys Gly Asp Tyr Trp Gly Gin Gly Thr Ser Val 

100 105 110 

Thr Val Ser Ser 
115 

<210> 19 
<211> 116 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> VH2.5 heavy chain 
<400> 19 

Gin He Gin Leu Val Gin Ser Gly Pro Glu Leu Lys Lys Pro Gly Ser 
15 10 15 

Ser Val Lys He Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 

20 25 30 

Gly Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Lys Trp Met 
35 40 45 

Gly Trp He Asn Thr Tyr Thr Gly Glu Pro Thr Tyr Ala Asp Asp Phe 
50 55 60 

Lys Gly Arg Phe Thr He Thr Ala Glu Thr Ser Thr Ser Thr Leu Tyr 
65 70 75 80 

Leu Gin Leu Asn Asn Leu Arg Ser Glu Asp Thr Ala Thr Tyr Phe Cys 

85 90 95 

Val Arg Phe He Ser Lys Gly Asp Tyr Trp Gly Thr Gly Thr Thr Val 

100 105 110 



Thr Val Ser Ser 
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115 



v. Z. J. \J > 




<211> 


116 


< z ±z > 


OUT 


<213> 


Artificial sequence 


<220> 




<223> 


KS VH veneered 


<400> 


20 


Gin lie Gin Leu Val Gin Ser 


1 


5 



10 15 

Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 

20 25 30 

Gly Met Asn Trp Val Lys Gin Ala Pro Gly Gin Gly Leu Lys Trp Met 
35 40 45 

Gly Trp lie Asn Thr Tyr Thr Gly Glu Pro Thr Tyr Ala Asp Asp Phe 
50 55 60 

Lys Gly Arg Phe Thr Phe Thr lie Glu Thr Ser Thr Ser Thr Ala Tyr 
65 70 75 80 

Leu Gin Leu Asn Asn Leu Arg Ser Glu Asp Met Ala Thr Tyr Phe Cys 

85 90 95 

Val Arg Phe lie Ser Lys Gly Asp Tyr Trp Gly Gin Gly Thr Thr Val 

100 105 110 

Thr Val Ser Ser 
115 

<210> 21 
<211> 116 
<212> PRT 

c213> Artificial sequence 
<220> 

<223> KS de-immunized VH1 
<400> 21 

Gin lie Gin Leu Val Gin Ser Gly Pro Glu Leu Lys Lys Pro Gly Ser 
15 10 15 

Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 

20 25 30 

Gly Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Lys Trp Met 
35 40 45 

Gly Trp lie Asn Thr Tyr Thr Gly Glu Pro Thr Tyr Ala Asp Asp Phe 
50 55 60 



Lys Gly Arg Phe Thr 
65 



He Thr Ala Glu Thr Ser Thr Ser Thr Leu Tyr 
70 75 80 
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Leu Gin Leu Asn Asn Leu Arg Ser Glu Asp Thr Ala Thr Tyr Phe Cys 

85 90 95 

Val Arg Phe Met Ser Lys Gly Asp Tyr Trp Gly Gin Gly Thr Thr Val 

100 105 110 

Thr Val Ser Ser 
115 

<210> 22 
<211> 116 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> KS de-immunized VH2 
<400> 22 

Gin lie Gin Leu val Gin Ser Gly Pro Glu Leu Lys Lys Pro Gly Ser 
15 10 15 

Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 

20 25 30 

Gly Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Lys Trp Met 

35 40 45 

Gly Trp lie Asn Thr Tyr Thr Gly Glu Pro Thr Tyr Ala Asp Asp Phe 
50 55 60 

Lys Gly Arg Phe Thr He Thr Ala Glu Thr Ser Thr Ser Thr Leu Tyr 
65 70 75 80 

Leu Gin Leu Asn Asn Leu Arg Ser Glu Asp Thr Ala Thr Tyr Phe Cys 

85 90 95 

Val Arg Phe He Ser Lys Gly Asp Tyr Trp Gly Gin Gly Thr Thr Val 

100 105 no 

Thr Val Ser Ser 
115 

<210> 23 
<211> 116 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> KS de- immunized VH3 
<400> 23 

Gin He Gin Leu Val Gin Ser Gly Pro Glu Leu Lys Lys Pro Gly Ser 
15 10 15 

Ser Val Lys He Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 

20 25 30 

Gly Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Lys Trp Met 
35 40 45 
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Gly Trp lie Asn Thr Tyr Thr Gly Glu Pro Thr Tyr Ala Asp Asp Phe 
50 55 60 

Lys Gly Arg Phe Thr He Thr Leu Glu Thr Ser Thr Ser Thr Ala Tyr 
65 70 75 80 

Leu Gin Leu Asn Asn Leu Arg Ser Glu Asp Thr Ala Thr Tyr Phe Cys 

85 90 95 

Val Arg Phe He Ser Lys Gly Asp Tyr Trp Gly Gin Gly Thr Thr Val 

100 105 110 

Thr Val Ser Ser 
115 

<210> 24 
<211> 116 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> KS- deimmunized VH4 
<400> 24 

Gin He Gin Leu Val Gin Ser Gly Pro Glu Leu Lys Lys Pro Gly Ser 
15 10 15 

Ser Val Lys He Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 

20 25 30 

■ 

Gly Met Asn Trp Val Lys Gin Ala Pro Gly Lys Gly Leu Lys Trp Met 
35 40 45 

Gly Trp He Asn Thr Tyr Thr Gly Glu Pro Thr Tyr Ala Asp Asp Phe 
50 55 60 

Lys Gly Arg Phe Thr He Thr Leu Glu Thr Ser Thr Ser Thr Ala Tyr 
65 70 75 80 

Leu Gin Leu Asn Asn Leu Arg Ser Glu Asp Thr Ala Thr Tyr Phe Cys 

85 90 95 

Val Arg Phe He Ser Lys Gly Asp Tyr Trp Gly Gin Gly Thr Thr Val 

100 105 no 

Thr Val Ser Ser 
115 



<210> 


25 


<211> 


116 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


KS de- immunized VH5 


<400> 


25 



Gin He Gin Leu Val Gin Ser Gly Pro Glu Leu Lys Lys Pro Gly Ser 
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15 10 15 

Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 

20 25 30 

Gly Met Asn Trp Val Lys Gin Ala Pro Gly Lys Gly Leu Lys Trp Met 

35 40 45 

Gly Trp He Asn Thr Tyr Thr Gly Glu Pro Thr Tyr Ala Asp Asp Phe 
50 55 60 

Lys Gly Arg Phe Ala Phe Thr Leu Glu Thr Ser Thr Ser Thr Ala Tyr 
65 70 75 80 

Leu Gin Leu Asn Asn Leu Arg Ser Glu Asp Thr Ala Thr Tyr Phe Cys 

85 90 95 

Val Arg Phe He Ser Lys Gly Asp Tyr Trp Gly Gin Gly Thr Thr Val 

100 105 110 

Thr Val Ser Ser 
115 

<210> 26 
<211> 116 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> KS VH mouse 
<400> 26 

Gin He Gin Leu Val Gin Ser Gly Pro Glu Leu Lys Lys Pro Gly Glu 
15 10 15 

Thr Val Lys He Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 

20 25 30 

Gly Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Lys Trp Met 
35 40 45 

Gly Trp He Asn Thr Tyr Thr Gly Glu Pro Thr Tyr Ala Asp Asp Phe 
50 55 60 

Lys Gly Arg Phe Val Phe Ser Leu Glu Thr Ser Ala Ser Thr Ala Phe 
65 70 75 80 

Leu Gin Leu Asn Asn Leu Arg Ser Glu Asp Thr Ala Thr Tyr Phe Cys 

85 90 95 

Val Arg Phe He Ser Lys Gly Asp Tyr Trp Gly Gin Gly Thr Ser Val 

100 105 no 

Thr Val Ser Ser 
115 

<210> 27 

<211> 25 

<212> DNA 

<213> Artificial sequence 
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<220> 
<223> 



KSA sense primer 



<400> 27 

tctagagcag catggcgccc ccgca 



25 



<210> 28 

<211> 24 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> KSA antisense primer 

<400> 28 

ctcgagttat gcattgagtt ccct 24 

<210> 29 

<211> 17 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> linker-adapter 

<400> 29 

aattctcaat gcagggc 17 

<210> 30 

<211> 17 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> linker- adapter 

<400> 30 

gagttacgtc ccgaatt 17 

<210> 31 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> VH forward primer 

<400> 31 

gactcgagcc caagtcttag acatc 25 



<210> 32 
<211> 33 
<212> DNA 
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<220> 

<223> VH reverse primer 
<400> 32 

caagcttacc tgaggagacg gtgactgacg ttc 33 



<2io> 33 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> VK forward primer 

<400> 33 

gatctagaca agatggattt tcaagtg 27 



<210> 34 

<211> 29 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> VK reverse primer 

<400> 34 

gaagatctta cgttttattt ccagcttgg 29 



<210> 35 
<211> 117 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> VH369 heavy chain 
<400> 35 

Gin lie Gin Leu Val Gin Ser Gly Pro Glu Leu Lys Lys Pro Gly Glu 
15 10 15 

Thr Val Lys He Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 

20 25 30 

Gly Met Asn Trp Val Lys Gin Thr Pro Gly Lys Gly Leu Lys Trp Met 
35 40 45 

Gly Trp lie Asn Thr Tyr Thr Gly Glu Pro Thr Tyr Ala Asp Asp Phe 
50 55 60 

Lys Gly Arg Phe Ala Phe Ser Leu Glu Thr Ser Ala Ser Thr Ala Phe 
65 70 75 80 

Leu Gin He Gin Gin Pro Gin Asn Met Arg Thr Met Ala Thr Tyr Phe 

85 90 95 
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Cys Val Arg Phe lie Ser Lys Gly Asp Tyr Trp Gly Gin Gly Thr Ser 

100 105 110 

Val Thr Val Ser Ser 

115 
<210> 36 
<211> 19 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> VH2 . 1 partial sequence 
<400> 36 

Ala Thr Tyr Phe Cys Val Arg Phe lie lie Ser Lys Gly Asp Tyr Trp 
15 10 15 

Gly Gin Gly 



<210> 37 
<211> 19 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> VH2.2 partial sequence 
<400> 37 

Ala Thr Tyr Phe Cys Val Arg Phe lie Val Ser Lys Gly Asp Tyr Trp 
i5 io 15 

Gly Gin Gly 



<210> 38 

<211> 19 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> VH2.3 partial sequence 

<400> 38 

Ala Thr Tyr Phe Cys Val Arg Phe 
1 5 

Gly Gin Gly 



He Ser Ala Lys Gly Asp Tyr Trp 
10 15 



<210> 39 

<211> 19 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> VH2.4 partial sequence 
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<400> 39 

Ala Thr Tyr Phe Cys Val Arg Phe lie Ser Lyg Thr Gly Asp Tyr Trp 
15 10 15 

Gly Gin Gly 

<210> 40 
<211> 10494 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> pdHL7s-VK8/VH7-IL2 sequence 
<400> 40 

gtcgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 60 

gtcgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 120 

gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 180 

ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 240 

ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 300 

atcaagtgta tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg 360 

cctggcatta tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg 420 

tattagtcat cgctattacc atggtgatgc ggttttggca gtacatcaat gggcgtggat 480 

agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 540 

tttggcacca aaatcaacgg gactttccaa aatgtcgtaa caactccgcc ccattgacgc 600 

aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctctc tggctaacta 660 

cagaacccac tgcttactgg cttatcgaaa ttaatacgac tcactatagg gagaccctct 720 

agaatgaagt tgcctgttag gctgttggtg ctgatgttct ggattcctgg tgaggagaga 780 

gggaagtgag ggaggagaat ggacagggag caggagcact gaatcccatt gctcattcca 84 0 

tgtatctggc atgggtgaga agatgggtct tatcctccag catggggcct ctggggtgaa 900 

tacttgttag agggaggttc cagatgggaa catgtgctat aatgaagatt atgaaatgga 960 
tgcctgggat ggtctaagta atgccttaga agtgactaga cacttgcaat tcactttttt 1020 
tggtaagaag agatttttag gctataaaaa aatgttatgt aaaaataaac gatcacagtt 1080 
gaaataaaaa aaaaatataa ggatgttcat gaattttgtg tataactatg tatttctctc 1140 

» 

tcattgtttc agcttcctta agcgagatcg tgctgaccca gtcccccgcc accctgtccc 1200 

tgtcccccgg cgagcgcgtg accctgacct gctccgcctc ctcctccgtg tcctacatgc 1260 

tgtggtacca gcagaagcca ggatcctcgc ccaaaccctg gatttttgac acatccaacc 1320 
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tggcttctgg 


attccctgct 


cgcttcagtg 


gcagtgggtc 


tgggacctct 


tactctctca 


1380 


taatcagcag 


catggaggct 


gaagatgctg 


ccacttatta 


ctgccatcag 


cggagtggtt 


1440 


acccgtacac 


gttcggaggg 


gggaccaagc 


tggaaataaa 


aegtaagate 


ccgcaattct 


1500 


aaactctgag 


ggggtcggat 


gacgtggcca 


ttctttgcct 


aaagcattga 


gtttactgea 


1560 


aggccagaaa 


agcatgcaaa 


gccctcagaa 


tggctgcaaa 


gagctccaac 


aaaacaattt 


1620 


agaactttat 


taaggaatag ggggaagcta ggaagaaact 


caaaacatca 


agattttaaa 


1680 


tacgcttctt 


ggtctccttg 


ctataattat 


ctgggataag 


catgctgttt 


tctgtctgtc 


1740 


cctaacatgc 


cctgtgatta 


tccgcaaaca 


acacacccaa 


gggcagaact 


ttgttactta 


1800 


aacaccatcc 


tgtttgcttc 


tttcctcagg 


aactgtggct 


gcaccatctg 


tcttcatctt 


1860 


cccgccatct 


gatgagcagt 


tgaaatctgg 


aactgcctct 


gttgtgtgcc 


tgctgaataa 


1920 


cttctatccc 


agagaggcca 


aagtacagtg 


gaaggtggat 


aacgccctcc 


aatcgggtaa 


1980 


ctcccaggag 


agtgtcacag agcaggacag 


caaggacagc 


acctacagcc 


tcagcagcac 


2040 


cctgacgctg 


agcaaagcag actacgagaa 


acacaaagtc 


tacgcctgcg 


aagtcaccca 


2100 


tcagggcctg 


agctcgcccg 


tcacaaagag 


cttcaacagg 


ggagagtgtt 


agagggagaa 


2160 


gtgcccccac 


ctgctcctca gttccagcct 


gaccccctcc 


catcctttgg 


cctctgaccc 


2220 


tttttccaca 


ggggacctac 


ccctattgcg 


gtcctccagc 


tcatctttca 


cctcaccccc 


2280 


ctcctcctcc 


ttggctttaa 


ttatgctaat 


gttggaggag 


aatgaataaa 


taaagtgaat 


2340 


ctttgcacct 


gtggtttctc 


tctttcctca 


atttaataat 


tattatctgt 


tgtttaccaa 


2400 


ctactcaatt 


tctcttataa 


gggactaaat 


a tg tag teat 


cctaaggcgc 


ataaccattt 


2460 


ataaaaatca 


tccttcattc 


tattttaccc 


tatcatcctc 


tgcaagacag 


tcctccctca 


2520 


aacccacaag 


ccttctgtcc 


tcacagtccc 


ctgggccatg gtaggagaga 


cttgcttcct 


2580 


tgttttcccc 


tcctcagcaa 


gccctcatag 


tcctttttaa 


gggtgacagg 


tettaeggtc 


2640 


atatatcctt 


tgattcaatt 


ccctgggaat 


caaccaaggc 


aaatttttca 


aaagaagaaa 


2700 


cctgctataa 


agagaatcat 


tcattgcaac 


atgatataaa 


ataacaacac 


aataaaagca 


2760 


attaaataaa 


caaacaatag ggaaatgttt 


aagttcatca 


tggtacttag 


acttaatgga 


2820 


atgtcatgcc 


ttatttacat 


ttttaaacag 


gtactgaggg 


actcctgtct 


gccaagggcc 


2880 


gtattgagta 


ctttccacaa 


cctaatttaa 


tccacactat 


actgtgagat 


taaaaacatt 


2940 

mm m^ • 


cattaaaatg 


ttgcaaaggt 


tctataaagc 


tgagagacaa 


atatattcta 


taactcagca 


3000 


atcccacttc 


tagggtcgac 


gttgacattg 


attattgact 


agttattaat 


agtaatcaat 


3060 


tacggggtca 


ttagttcata 


gcccatatat 


ggagttccgc 


gttacataac 


ttacggtaaa 


3120 
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Lyy CCCy CCl 


ggctgaccgc 


ccaacgaccc 


-rtta — a> a««« aM « *B» ^B» 

ccgcccattg 


•^l a**- a*m Ab> a***! "™W A— a_ a. 

aegtcaataa 


4— a*BB> a in fci a**. A— — ■ j 

tgacgt atgt 


3180 


r a-*—* 4— -** W 

ccccaLdgid 


acgccaacag 


ggact ttcca 


t tgacgt caa 


^ggg c 9g a g^ 


«bw Aa» A>« AW *\ a*BB aBBa* ^bb> AaBi a«. 

atttaeggea 


3240 


aactg cccac 


tuggcagcac 


at caagtgta 


teatatgeca 


agcacgcccc 


a**A 4b» — *, A> a**, ^bw afBMf A— 

ct at tgacgt 


3300 


^Bhn a^a. aBM ^daa* bbb. A< aana ^aa> aWa> 

caatgacggt 


aaatggcccg 


cctggcatta 


a^^ a^^ar ^*Bi ■ » ■ a» fc a^^r ^b^ -aaa 

tgcccagtac 


atgaccttat 


^BB> ^Baa au am ate BVB> #a> aV» ^BH .^aak 

gggactttcc 


^» ^ r\ 
3360 


tacttggcag 


tacatctacg 


tattagtcat 


- ■ ,_ar »■ * a« aiaw job 

cgctattacc 


«Bh Aa> — — - Aa> i— - * — - 

atggtgatgc 


ggttttggca 


3420 


JhA ' atfBi. «BB> .BBBB aB*. BBk. *BB 

gtacatcaat 


a» ■ u 4« _h _i— A» 

gggcgtggst 


yp-a ^b»» aw 4Wa» «V«> .r ■ a 

a 9 c 99 ttt 9 a 


etcaegggga 


cttccaagtc 


tccaccccat 


3480 


tgacgtcaat 


gggaguttgt 


^B> 4>l ^BM# >— Bf ^B> M «*B. 

tctggcacca 


aaatcaaegg 


l**»r ^> a^Ba Aa> Abv 4a» — — _ 

gactttccaa 


V b*b. Abb ^ha* 4a« aBMfl -*~ J #■ HB, av"*. 

aatgtcgtaa 


T f- y* l-» 

3540 


caaccccgcc 


cca tcgacgc 


aaatgggcgg 


taggcgtgta 


cggtgggagg 


tctatataag 


3600 


cagag ccccc 


CygCCaaCCcl 


CdyddCCCaC 


Lycccaccyy 


ctcaccgaaa 


t caacacgac 


3 oo 0 


tcacta tagg 


g ag acccaag 


/-^ 4~ |^ 1^*1 /*» /Vrt 

CuCCCCCjcly y 


ctagaatgaa 


gttgcctgt t 


*"* ^t/« a*** 4ak 4b> 4» aBB_ i- - 

aggctgttgg 


3 72 0 




ctygattcct 


ggegaggaga 


9 a 999 aa 9 c 9 


agggaggaga 


a tggacaggg 


T TO A 

3 780 


agcaggagca 


ctgaacccca 


ctgcccattc 


catgtatcug 


^bb> a*V a<B^ a«Bat 4a* a*h» B"^k 

geatgggega 


aBKBT #S a^Bh ^B—b aBB» Abb a*ka> afBaa aW)BB Aw 

gaagatgggt 


3840 


y- * la- A* — k A— _y** A*» — — 

cttatccccc 


agcatggggc 


ctctggggtg 


^Bk A - * 1 i*B— Aa> ^a> 

aatacttgtt 


a 9 a 999 a 99t 


tccagatggg 


"1 i"\ ft 

3900 


*"% ""a — % A* A"» »•"•■>■ y**— i 4— 

adCatyCyCt 


acaatgaaga 


tea tgaaatg 


a*"tf B^% a*^ a^J fBaata 

gatgcctggg 


atggtctaag 


taatgectta 


3960 


gaagtgacta 


gaeacttgea 


atccactttt 


tttggtaaga 


■ ■ — - ah. aB. 4bb- A_ A t a^a> bLjb 

agagattttt 


aggctataaa 


jt ***4 

4020 


aaaat.gtt.at 


gtaaaaataa 


acgatcacag 


^b> Aa> alB> a*« ^BV aB>& a*B. aaa, 

ttgaaataaa 


aaaaaaatat 


aaggatgttc 


_1 a*V a#*h i*K 

4080 


atgaattttg 


tgtataacta 


tgtatttctc 


tctcattgtt 


tcagcttcct 


taagecagat 


4140 


ccagttggtg 


cagtctggag 


ctgaggtgaa 


gaagcctgga 


gagacagtca 


agatctcctg 


4200 


caaggcttct 


gggtatacct 


tcacaaacta 


tggaatgaac 


tgggtg^gc 


agactccagg 


a a— w, _•*> a*9i 

4260 


aaagggttta 


aagtggatgg 


gctggataaa 


^b» «aw ^aw a#N_ «bb. m bvb. ^ak 

cacctacact 


^Bav «<t JPV b«b. anh, >aB aVBa a«B^ aam 

ggagaaccaa 


a"B» AW ak Ab> aVBaa b»bb Ab> ^Sa_, aH. 

catatgetga 


X "l ft ft 

4320 


t" <T O /"» t" t~ /fa a/~r 
tydCLUCddy 


ggacggcctg 


CCttCLCULU 


yyaaacctcc 


accagcactg 


>— — .fat AW A— > A_ Abb AW Jfm- a*W — > a. 

ectttttgea 


A T O ft 

4380 


gaLCaaCddC 


cucagaagtg 


aggacaegge 


tacatatt tc 


4v _r_i Ajb> ju ^b> aaH. . lj m 4> 

tgtgtaagat 


ttatttctaa 


4440 


ggggg a ctac 


tsgggtcaag 


gaacgtcagt 


caccgtctcc 


tcaggtaagc 


tttctggggc 


4500 


aggccaggcc 


tgacct: tggc 


^* ^» y^a«- ^-»a» a»-^ /bj j aa 

tttggggcag 


ggagggggct 


aaggtgaggc 


aggtggcgcc 


4560 


agccaggtgc 


acacccaatg 


cccatgagcc 


a^B- b*b. aiaa> a*Ba. Aa* i— at- afcal 

cagacactgg 


^bbi aVai aw Ab> a>a aSkt b*Jb. jbi jbi 

acgctgaacc 


tegeggacag 


a y ft ft 

4620 


t taagaaccc 


a 9999 cctct 


gcgccctggg 


aBBfc b*bbi «bv jai i* afBB Ab« — 

cccagctccg 


tcccacaccg 


eggtcacatg 


4680 


gcaccacctc 


tcttgcagcc 


tccaccaagg 


gcccatcggt 


cttccccctg 


gcaccctcct 


4740 


ccaagagcac 


ctctgggggc 


acagcggccc 


tsggctgect 


ggtcaaggac 


tacttccccg 


4800 


aaceggtgae 


ggtgtcgtgg 


aactcaggcg 


ccctgaccag 


cggcgtgcac 


accttcccgg 


4860 


ctgtcctaca 


gtcctcagga 


ctctactccc 


teagcagegt 


ggtgaccgtg 


ccctccagca 


4920 


gcttgggcac 


ccagacctac 


atetgeaacg 


tgaatcacaa 


gcccagcaac 


accaaggtgg 


4980 
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acaagagagt 


tggtgagagg 


ccagcacagg 


gagggagggt 

» 


gtctgctgga 


agccaggctc 


5040 


agcgctcctg 


cctggacgca 


tcccggctat gcagtcccag 


tccagggcag 


caaggcaggc 


5100 


cccgtctgcc 


tcttcacccg 


gaggcctctg 


cccgccccac 


tcatgctcag ggagagggtc 


5160 


ttctggcttt 


ttccccaggc 


tctgggcagg 


cacaggctag 


gtgcccctaa 


cccaggccct 


5220 


gcacacaaag 


gggcaggtgc 


tgggctcaga 


cctgccaaga gccatatccg ggaggaccct 


5280 


gcccctgacc 


taagcccacc 


ccaaaggcca 


aactctccac 


tccctcagct 


cggacacctt 


5340 


ctctcctccc 


agattccagt 


aactcccaat 


cttctctctg 


cagagcccaa 


atcttgtgac 


5400 


aaaactcaca 


catgcccacc 


gtgcccaggt 


aagccagccc 


aggcctcgcc 


ctccagctca 


5460 


aggcgggaca 


ggtgccctag 


agtagcctgc 


atccagggac 


aggccccagc 


cgggtgctga 


5520 


cacgtccacc 


tccatctctt 


cctcagcacc 


tgaactcctg gggggaccgt 


cagtcttcct 


5580 


cttcccccca 


aaacccaagg 


acaccctcat 


gatctcccgg 


acccctgagg 


tcacatgcgt 


5640 


ggtggtggac 


gtgagccacg 


aagaccctga 


ggtcaagttc 


aactggtacg 


tggacggcgt 


5700 


ggaggtgcat 


aatgccaaga 


caaagccgcg 


ggaggagcag 


tacaacagca 


cgtaccgtgt 


5760 


ggtcagcgtc 


ctcaccgtcc 


tgcaccagga 


ctggctgaat 


ggcaaggagt 


acaagtgcaa 


5820 


ggtctccaac 


aaagccctcc 


cagcccccat 


cgagaaaacc 


atctccaaag 


ccaaaggtgg 


5880 


gacccgtggg 


gtgcgagggc 


cacatggaca 


gaggccggct 


cggcccaccc 


tctgccctga 


5940 


gagtgaccgc 


tgtaccaacc 


tctgtcccta 


cagggcagcc 


ccgagaacca 


caggtgtaca 


6000 


ccctgccccc 


atcacgggag 


gagatgacca 


agaaccaggt 


cagcctgacc 


tgcctggtca 


6060 


aaggcttcta 


tcccagcgac 


atcgccgtgg 


agtgggagag 


caatgggcag 


ccggagaaca 


6120 


actacaagac 


cacgcctccc 


gtgctggact 


ccgacggctc 


cttcttcctc 


tatagcaagc 


6180 


tcaccgtgga 


caagagcagg 


tggcagcagg ggaacgtctt 


ctcatgctcc 


gtgatgcatg 


6240 


aggctctgca 


caaccactac 


acgcagaaga gcctctccct gtccccgggt 


aaagccccaa 


6300 


cttcaagttc 


tacaaagaaa 


acacagctgc 


aactggagca 


tctcctgctg 


gatctccaga 


6360 


tgattctgaa 


tggaattaac 


aactacaaga 


atcccaaact 


caccaggatg 


ctcacattca 


6420 


agttctacat 


gcccaagaag 


gccacagagc 


tcaaacatct 


ccagtgtcta 


gaggaggaac 


6480 


tcaaacctct 


ggaggaagtg 


ctaaacctcg 


ctcagagcaa 


aaacttccac 


ttaagaccta 


6540 


gggacttaat 

*_j ~j 


caqcaatatc 


aacgtaatag 


ttctggaact 


aaagggatcc 


gaaacaacat 


6600 


tcatgtgtga 


atatgctgat 


gagacagcaa 


ccattgtaga 


attcctaaac 


agatggatta 


6660 


ccttttgtca 


aagcatcatc 


tcaacactaa 


cttgataatt 


aagtgctcga 


gggatccaga 


6720 


catgataaga 


tacattgatg 


agtttggaca 


aaccacaact 


agaatgcagt 


gaaaaaaatg 


6780 
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CtCCdCtCyt 




aLyctat tyc 


UttatLLy ta 


dCC a L- Layoa 




oo4U 


acaagttaac 


aacaacaau l 


y— * ^ k A — > fr- 

gcacccdLLC 


4— r*i f-r ^ ^ 4* ^ * 

CacgULLCay 


g c c caggggg 


a 99 c 9^999 a 


oyuu 


4— 4-» 4«- ^* 4f- 4— — «i 

ggtxtttcaa 


agcaag t.aaa 


acc l ctacaa 


atgcgguacg 


y c u g ci 1. 1. a u g 


dtcccyccuc 


rncn 


gcgcgtttcg 


gtgatgacgg 


tgaaaacccc 


cgacacacgc 


age tecegga 


ydcyytcaca 


/ uz u 


gcttgtctgt 


aagcggatgc 


cgggagcaga 


caagcccgcc 


a 999 c 9cgtc 


*^ y^>> a*^p j^b> yw 4"* y*v 4 Bb A~ 

a 9cgggtgtt 


/UoU 


ggcgggtgtc 


9999 c 9 ca 9 c 


cacgacccag 


ccacgcagcg 


auageggage 


gcacaccggc 


A A 
/ I4U 


ttaactatgc 


ggcaccagag 


cagacugcac 


cgagagcgca 


cca to cgegg 


4— y^w 4— y**f — k ^ 4^ ^ 

cgcgaaacac 


/ zuv 


yffl jPW a"""* ^ % y»*i J ij 4^* yk| 

cgcacagatg 


cgcaaggaga 


dddldCCyCd 


ccaggcgcuc 


t~ <^ C+f*1 ♦* ^« 

LLLLyCttCC 


Lcgcccactg 


/-4DU 


acccgccgcg 


CCCyy uCyCC 


cggccgcggc 


gagcggcacc 


dyCCCaCLCd 


aaggcggcaa 


/ jzu 


tacggttatc 


cacagaatca 




caggaaagaa 


catgtgagca 


aaaggecage 


7 380 


aaaaggccag 


gaaccgcaaa 


aaggccgcgc 


tgcuggcgcc 


ti tccacagg 


cxccgccccc 


/44U 


c tgacgagca 


tcacaaaaac 


cgacgctcaa 


gucagaggcg 


gcgaaacccg 


acaggaccat 


7500 


aaagatacca 


ggcgc tuccc 


cccggaagcc 


ccctcgcgcg 


4*"> 4^ 4^ j !■ A— 4* 

cccccccgtt 


ccgaccctgc 


7560 


cgcc caccgg 


4— m — h, r A* * 

acaccuyLcc 


/ — f j>| y— ^ 4— +— 4— *—l /"i -n 

gccLLLCccc 


cttcgggaag 


C 9 t 99 c 9 ctt 


cctcaatgct 


r» <r 0 y\ 

7620 


cacgcugtag 


gcacctcagc 


^ c 99^-9 ta 99 


4^ a/v 4™ ^™ j*"^ 4* 

ccgcccgccc 


caagccgggc 


4" j**kj 4" 4" j*V jyk* jpq> yk 

tgcgtgcacg 


"t /* 0 r\ 
7680 


aaccccccgt- 


t. cagcccgac 


cgccgcgccu 


4* — v 4* y^*> AfV/W 4» a a 

cacccggcaa 


jm4> •^■k 4** yJ«y*j)4* 4/* 4" 

ctaLcgtccc 


gagtccaacc 


7740 


cggtaagaca 


cgactcaccg 


ccactggcag 


cagccactgg 


taacaggatt 


ageagagega 


7800 


jritaa* ^»k» *W fW y<S# dam j« «v 

ggtatgtagg 


cggtgccaca 


gagttctcga 


a 9tggtggcc 


taactaegge 


tacactagaa 


f-f 4**4 jf* yy\ 

7860 


ggacagtact 


4~ y t ^» 4— 4— 4* ^* 

uggtatccgc 


gccctgctga 


^■■t JHJ ~ J—m 4W 4W «K r - 1 

agccagttac 


etteggaaaa 


agagttggta 


7920 


gcuccxgatc 


cggcaaacaa 


accaccgctg 


gtagcggtgg 


•W 4» aU. 4-t J— i_ JV- — - j_ Ay. 

tttttttgtt 


tgeaagcage 


7980 


agat cacgcg 


cagaaaaaaa 


ggatcccaag 


jiij k 4> jf^w j^^a 4r* 4>* ^» 

aagatcct-Ct 


>f«w 4y> jp-a 4y» 4jb 4W 4» ■ ■ «W 

gatcttttct 


aeggggtctg 


8040 




gaacgaaaac 


ccacgc taag 


y - ** 1 4~ 4~ 4~ 4"* st 4— 

ggatttcggt 


4~ y% *^ ^*_> 4> 4— a 

catgaga t ta 


tcaaaaagga 


8100 


t- f :a /~» /"» a 


yacccuttca 


aattaaaaat 


gaagc nccaa 


atcaatctaa 


agtatatatg 


8160 




yccuyaCayc 


taccaacgcL 


caaccagcga 


ggcacccatc 


teagegatet 


8220 


yCCLdLLUCy 


4» 4)* *\ 4— y^v >h 4** — * 


gccgcctgac 


tccccgtcgt 


gtagataact 


aegataeggg 


8280 


agggct cacc 


atctggcccc 


agtgctgcaa 


tgataccgcg 


agacccacgc 


tcaccggctc 


8340 


cagacccatc 


agcaataaac 


cagccagccg 


gaagggccga 


gcgcagaagt 


ggtcctgcaa 


8400 


4~ 4^ 4» ^% 4^ m 

ccLtdtccgc 


ccccatccag 


tctatt.aatt 


gttgccggga 


agctagagta 


agtagttcgc 


8460 


cagttaatag 


tttgcgcaac 


gttgttgcca 


ttgctgcagg 


catcgtggtg 


tcacgctcgt 


8520 


cgtttggtat 


ggcttcattc 


agctccggtt 


cccaacgatc 


aaggcgagtt 


acatgatccc 


8580 


ccatgttgtg 


caaaaaagcg 


gttagctcct 


tcggtcctcc 


gatcgttgtc 


agaagtaagt 


8640 
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tggccgcagt 


gttatcactc 


atggttatgg 


cagcactgca 


taattctctt 


actgtcatgc 


8700 


catccgtaag 


atgcttttct 


gtgactggtg 


agtactcaac 


caagtcattc 


tgagaatagt 


8760 


gtatgcggcg 


accgagttgc 


tcttgcccgg 


cgtcaacacg 


ggataatacc 


gcgccacata 


8820 


gcagaacttt 


aaaagtgctc 


atcattggaa 


aacgttcttc 


ggggcgaaaa 


ctctcaagga 


8880 


tcttaccgct 


gttgagatcc 


agttcgatgt 


aacccactcg 


tgcacccaac 


tgatcttcag 


8940 


catcttttac 


tttcaccagc 


gtttctgggt 


gagcaaaaac 


aggaaggcaa 


aatgccgcaa 


9000 


aaaagggaat 


aagggcgaca 


cggaaatgtt 


gaatactcat 


actcttcctt 


tttcaatatt 


9060 


attgaagcat 


ttatcagggt 


tattgtctca 


tgagcggata 


catatttgaa 


tgtatttaga 


9120 


aaaataaaca 


aataggggtt 


ccgcgcacat 


ttccccgaaa 


agtgccacct 


gacgtctaag 


9180 


aaaccattat 


tatcatgaca 


ttaacctata 


aaaataggcg 


tatcacgagg 


ccctttcgtc 


9240 


ttcaagaatt 


ccgatccaga 


catgataaga 


tacattgatg 


agtttggaca 


aaccacaact 


9300 


agaatgcagt 


gaaaaaaatg 


ctttatttgt 


gaaatttgtg 


atgctattgc 


tttatttgta 


9360 


accattagaa 


gctgcaataa 


acaagttaac 


aacaacaatt 


gcattcattt 


tatgtttcag 


9420 


gttcaggggg 


aggtgtggga 


ggttttttaa 


agcaagtaaa 


acctctacaa 


atgtggtatg 


9480 


gctgattatg 


atctaaagcc 


agcaaaagtc 


ccatggtctt 


ataaaaatgc 


atagctttcg 


9540 


gaggggagca 


gagaacttga 


aagcatcttc 


ctgttagtct 


ttcttctcgt 


agaccttaaa 


9600 


ttcatacttg 


attccttttt 


cctcctggac 


ctcagagagg 


acgcctgggt 


attctgggag 


9660 


aagtttatat 


ttccccaaat 


caatttctgg 


gaaaaacgtg 


tcactttcaa 


attcctgcat 


9720 


gatccttgtc 


acaaagagtc 


tgaggtggcc 


tggttgattc 


atggcttcct 


ggtaaacaga 


9780 


actgcctccg 


actatccaaa 


ccatgtctac 


tttacttgcc 


aattccggtt 


gttcaataag 


9840 


tcttaaggca 


tcatccaaac 


ttttggcaag 


aaaatgagct 


cctcgtggtg 


gttctttgag 


9900 


ttctctactg 


agaactatat 


taattctgtc 


ctttaaaggt 


cgattcttct 


caggaatgga 


9960 


gaaccaggtt 


ttcctaccca 


taatcaccag 


attctgttta 


ccttccactg 


aagaggttgt 


10020 


ggtcattctt 


tggaagtact 


tgaactcgtt 


cctgagcgga 


ggccagggtc 


ggtctccgtt 


10080 


cttgccaatc 


cccatatttt 


gggacacggc 


gacgatgcag 


ttcaatggtc 


gaaccatgag 


10140 


ggcaccaagc 


tagctttttg 


caaaagccta 


ggcctccaaa 


aaagcctcct 


cactacttct 


10200 


ggaatagctc 


axjaggccgag 


gcggcctcgg 


cctctgcata 


aataaaaaaa 


attaqtcaqc 


10260 


catggggcgg 


agaatgggcg 


gaactgggcg 


gagttagggg 


cgggatgggc 


ggagttaggg 


10320 


gcgggactat 


ggttgctgac 


taattgagat 


gcatgctttg 


catacttctg 


cctgctgggg 


10380 


agcctgggga 


ctttccacac 


ctggttgctg 


actaattgag 


atgcatgctt 


tgcatacttc 


10440 
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tgcctgctgg ggagcctggg gactttccac accctaactg acacacattc caca 104 94 



<210> 


41 


<2ll> 




<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


heavy chain- IL2 


<400> 


41 



Gin lie Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Glu 
15 10 15 

Thr Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 

20 25 30 

Gly Met Asn Trp Val Lys Gin Thr Pro Gly Lys Gly Leu Lys Trp Met 
35 40 45 

Gly Trp lie Asn Thr Tyr Thr Gly Glu Pro Thr Tyr Ala Asp Asp Phe 
50 55 60 

Lys Gly Arg Phe Ala Phe Ser Leu Glu Thr Ser Thr Ser Thr Ala Phe 
65 70 75 80 

Leu Gin lie Asn Asn Leu Arg Ser Glu Asp Thr Ala Thr Tyr Phe Cys 

85 90 95 

Val Arg Phe lie Ser Lys Gly Asp Tyr Trp Gly Gin Gly Thr Ser Val 

100 105 110 

Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala 
115 120 125 

Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu 
130 135 140 

Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly 
145 150 155 160 

Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser 

165 170 175 

Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu 

180 185 190 

Gly Thr Gin Thr Tyr He Cys Asn Val Asn His Lys Pro Ser Asn Thr 
195 200 205 

Lys Val Asp Lys Arg Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
210 215 220 

Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe 
225 230 235 240 

Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met He Ser Arg Thr Pro 

245 250 " 255 
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Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val 

260 265 270 

Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr 
275 280 285 

Lys Pro Arg Glu Glu Gin Tyr Asn Ser Thr Tyr Arg Val Val Ser Val 
290 295 300 

Leu Thr Val Leu His Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys 
305 310 315 320 

Lys Val Ser Asn Lys Ala Leu Pro Ala Pro lie Glu Lys Thr lie Ser 

325 330 335 

Lys Ala Lys Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro 

340 345 350 

Ser Arg Glu Glu Met Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val 
355 360 365 

Lys Gly Phe Tyr Pro Ser Asp lie Ala Val Glu Trp Glu Ser Asn Gly 
370 375 380 

Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp 
385 390 395 400 

Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp 

405 410 415 

Gin Gin Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His 

420 425 430 

Asn His Tyr Thr Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys Ala Pro 
435 440 445 

Thr Ser Ser Ser Thr Lys Lys Thr Gin Leu Gin Leu Glu His Leu Leu 
450 455 460 

Leu Asp Leu Gin Met lie Leu Asn Gly lie Asn Asn Tyr Lys Asn Pro 
465 470 475 480 

Lys Leu Thr Arg Met Leu Thr Phe Lys Phe Tyr Met Pro Lys Lys Ala 

485 490 495 

Thr Glu Leu Lys His Leu Gin Cys Leu Glu Glu Glu Leu Lys Pro Leu 

500 505 510 

Glu Glu Val Leu Asn Leu Ala Gin Ser Lys Asn Phe His Leu Arg Pro 
515 520 525 

Arg Asp Leu He Ser Asn He Asn Val He Val Leu Glu Leu Lys Gly 
530 535 540 

Ser Glu Thr Thr Phe Met Cys Glu Tyr Ala Asp Glu Thr Ala Thr He 
545 550 555 560 



Val Glu Phe Leu Asn Arg Trp He Thr Phe Cys Gin Ser He He Ser 

565 570 575 
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Thr Leu Thr 



<210> 42 
<211> 213 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> light chain 
<400> 42 

Glu lie val Leu Thr Gin Ser Pro Ala Thr Leu Ser Leu Ser Pro Gly 
15 10 15 

Glu Arg Val Thr Leu Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 

Leu Trp Tyr Gin Gin Lys Pro Gly Ser Ser Pro Lys Pro Trp He Phe 
35 40 45 

Asp Thr Ser Asn Leu Ala Ser Gly Phe Pro Ala Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Ser Tyr Ser Leu He He Ser Ser Met Glu Ala Glu 
65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys His Gin Arg Ser Gly Tyr Pro Tyr Thr 

85 90 95 

Phe Gly Gly Gly Thr Lys Leu Glu He Lys Arg Thr Val Ala Ala Pro 

100 105 110 

Ser Val Phe He Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser Gly Thr 
115 120 125 

Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala Lys 
130 135 140 

Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser Gly Asn Ser Gin Glu 
145 150 155 160 

Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser 

165 170 175 

Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala 

180 185 190 

Cys Glu Val Thr His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe 
195 200 205 



Asn Arg Gly Glu Cys 
210 



